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O (57) Abstract: The present invention provides a method of genotyping bovine for improved milk production traits by deteimining 
the DGATl genoiypic state of said bovine, wherein the DGATl gene and polymorphisms within said gene have been found to be 
associated with such improved milk production traits. 
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Marker Assisted Selection of Bovine for Improved Milk Production Using Diacylglycerol 
Acyitransferase gene DGATl 

FIELD OF THE INVENTION 

5 This invention relates to an application of marker assisted selection of bovine for a 
quantitative trait loci (QTL) associated with milk production, particularly although by 
no means exclusively, by assaying for the presence of at least one allele which is 
associated -with increased milk volume as well as improved milk composition, The 
present invention also relates to the gene associated with the QTL, various 
10 polymorphisms within the gene sequence, proteins encoded by these sequences as well 
as to the application of all of these in the farming industry. 

BACKGROUND 

15 The genetic basis of bovine milk production is of immense significance to the dairy 
industry. An ability to modulate milk volumes and content has the potential to alter 
farming practices and to produce products which are tailored to meet a range of 
requirements. In particular, a method of genetically evaluating bovine to select those 
which express desirable traits, such as increased milk production and improved milk 

20 composition, would be desirable. 

To date, bovine genomics are poorly understood and little is known regarding the genes 
which are critical to milk production. While there have been reports of quantitative trait 
loci (QTLs) on bovine chromosome 14 postulated to be associated with milk production 
25 (Coppieters et al (1998)), tiie specific genes involved have not to date been identified. 

Marker assisted selection, which provides the ability to follow a specific favourable 
genetic allele, involves the identification of a DNA molecular marker or markers that 
segregate with a gene or group of genes associated with a QTL. DNA markers have 

30 several advantages. They are relatively easy to measiire and are unambiguous, and as 
DNA markers are co-dominant, heterozygous and homozygous animals can be 
distinctively identified. Once a marker system is established, selection decisions are 
able to be made very easily as DNA markers can be assayed at any time after a DNA 
containing sample has been collected from an individual infant or adult animal, or even 

35 earlier as it is possible to test embryos in vitro if such embryos are collected. 
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The applicants have now identified a gene responsible for the QTL effect on bovine 
chromosome 14 as well as a niimber of polymorphisms which are associated with 
distinct genetic merits of animals for miUc composition and volume. 

5 It is an object of the present invention to provide an application method for marker 
assisted selection of this bovine gene, and in particular, of the polymorphisms in the 
bovine gene which are associated with increased milk volume and altered milk 
composition; and/ or to provide genetic markers for use in such a method; and/ or to 
provide the nucleic acid and amino acid sequences of this gene and encoded 
10 polypeptide; and/ or to provide animals selected using the method of the invention as 
well as milk produced by the selected animals; and/ or to provide the public with a 
useful choice. 

SUMMARY OF THE INVENTION 

15 - 

This invention relates to the discovery of the bovine Diacylglycerol-o-acyitransferase 
(DGATl) gene and polymorphisms within the bovine DGATl gene which are associated 
with increased milk yield and altered milk composition. 

20 More specifically, several polymorphisms in the bovine DGATl gene have been identified 
distinguishing multiple DGATl alleles in different cattle breeds. These polymorphisms 
include: K232A (Bases 6829/30 AA-CG nucleic acid change and K-A amino acid 
change); Nt984+8(Base 7438 A-G nucleic acid change); Nt984+26(Base 7456 C-T 
nucleic acid change); Ntl470+85(Base 8402 C-T nucleic acid change); Ntl9 1+435 (Base 

25 626 T-G nucleic acid change); Ntl91-3321 (Base 3512 T-G nucleic acid change); 
Nt279+144 (Base 4040 T-C nucleic acid change); Nt279+1067 (Base 4963 A-G nucleic 
acid change); Nt279+1107 (Base 5003 G-A nucleic acid change); Nt358 (Base 5997 C-T 
nucleic acid change); Nt754+3 (Base 6892 G-A nucleic acid change); Nt897+32 (Base 
7224/5 GG-AC nucleic acid change); Ntl251+42 (Base 7987 G-A nucleic acid change) 

30 . as summarised in Table 1. In particular, DGATl alleles characterized by the K232A 
mutation have been identified as being associated with an increased milk volume and 
altered milk composition in animals dependent upon whether they are homozygous with 
or without the mutation or heterozygous carrying one mutated allele. More specifically, 
the presence of the K232A mutation results in a decrease in milkfat percentage, milkfat 

35 yield, solid fat content and milk protein percentage, while increasing mdlk volume and 
milk protein yield. 
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The present invention thus relates to the use of the poljnnorphisms in a method of 
identification and selection of a bovine having at least one of said polymorphisms as 
well as to providing markers specific for such identification. Kits comprising said 
markers for use in marker selection also form part of the present invention as do 
5 animals so selected, the milk produced by such selected animals and products 
produced from such milk, particularly as such milk and milk products affect processing 
and/ or health characteristics for consumers. 

In particular, the present invention is directed to a method of genotyping cows or bulls 
10 for one or more of the polymorphisms disclosed herein, selected cows or bulls so 
genotyped and milk and semen from said selected cows and bulls respectively. 

According to a further aspect the present invention is directed to the isolated DGATl 
nucleic acid and allelic nucleic acid molecules comprising polymorphisms as well as to 
15 the proteins encoded thereby and their polypeptide sequences. Antibodies raised 
against said proteins are also contemplated, as are vectors comprising the nucleic acid 
molecules, host cells comprising the vectors; and protein molecules expressed in said 
host cells; and the application of all of them in the farming industry., 

20 In particular, such applications include methods for modulating milk production 
and/ or composition in a lactating bovine by affecting DGATl activity, by reducing the 
activity of DGATl (e.g. by use of specific ribozymes, antisense sequences and/or 
antibodies, or by transgenic technology to produce a "knock out" bovine and/ or bovine 
with introduced transgenes containing the DGATl gene and/ or variations of this gene 

25 driven by various promoters). 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described with reference to the Figures of the accompanying 
30 drawings in which: 

Figiirel: Shows a BAG contig spanning the BULGE 13-BULGB09 interval relative to a 
schematic diagram of bovine chromosome 14 and a schematic diagram showing the 
location of the genetic markers. The most likely position of the QTL is shown as a bar 
35 . on the FISH-anchored linkage map proximal to BTA14q. The BACs composing the 
contigs spanning the EULGE13-BULGE09 interval are shown as a series of horizontal 
lines. The symbols on each BAG indicate their individual STS content: solid circles 
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correspond to STS derived from BAG ends, open boxes to microsatellite markers, and 
solid triangles to gene-specific Comparative Anchored Tagged Sequences. The arrow 
heads mark the BACs from which the respective BAG end STS were derived. The length 
of the lines do not reflect the actual insert size of the corresponding BACs. The BAG 
5 contig was aligned with the orthologous himian HSA8q24.3 genomic "golden path" 
sequence represented according to the Ensembl Human Genome Server 
(http: / /www.ensembLorg/) : individual sequence contigs are shown in alternating light 
and dark; a horizontal line indicates a gap in the sequence assembly; genetic markers 
are indicated under the contig map; the lines and boxes above the contig map represent 
10 "curated", "predicted known** or "predicted noveF genes. 

Figures 2a and 2b: Show the genomic sequence of the bovine DGATl gene. Figure 2a 
is the 31 base pair sequence upstream but adjacent to the ATG or translation start site 
and is 5'UTR. Figure 2b is the genomic sequence in the bovine DGATl gene from the 
15 ATG translation start site (base 1) through to genomic sequence flanking the gene at the 
3' end. The significant features including intron/exon boundaries, poljrmorphic sites, 
polyadenylation signal, and alternate splicing site and some of the primer sequences 
used in the assays described herein, are indicated; 

20 Figure 3: Shows the genomic organization, four polymorphisms and haplotypes found 
in the bovine DGATl gene. Leader and trailer sequences are shown in light grey, coding 
sequences in dark grey and intronic sequences as a hollow line. The positions of four of 
the identified polymorphisms are marked as shown on the gene, and detailed in the 
underlying boxes including the corresponding sequence traces All the sequence 

25 variations are summarised in Table 1. The four DGATl haplotypes which were found in 
the Dutch and New-Zealand Holstein-Friesian population as defined by these 
polymorphisms are shown and referred to as "sH^^", 'sHQ-^" , "sHQ-^" for the fat 
increasing haplotypes and "sh9* for the fat decreasing haplotype; 

30 Figure 4a: Shows the corresponding full length amino acid sequence for DGATl 
sequence of Figure 2b including annotation of the amino acid substitution; 

Figure 4b: Shows the amino acid sequence predicted as a result of alternate splicing 
with exon VIII; 

35 

Figure 5: Shows the multiple peptide alignment of a portion of the DGATl protein 
flanking the K232A substitution from Bos tauruSf Bison bison, Ouis ones, Sus scrofa. 
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Homo sapiens, Cercopithecus aethiopSy Mus mvscahis domesticus and Rattus noruegicus 
showing the evolutionary conservation of the lysine mutated in the bovine K232A 
polymorphism; 

5 Figure 6: A. Shows , the frequency distribution of observed DGATl SNP haplolypes in 
the Dutch and New Zealand Holstein-Friesian dairy cattle populations. B^D. Shows the 
frequency distribution of the combined microsatellite (BULGE09-BULGE1 1) and SNP 
DGATl haplotypes. The HQ-^ and fJ?-^haplotypes are shown; and 

10 Figiure 7: Shows the lod score due to LD when including {+) or excluding {-) the four 
DGATl polymorphisms shown in Figure 3 in a combined linkage and LD multipoint 
maximum likelihood mapping method. The lod score corresponds to the logio of the 

ratio between the likelihood of the data assuming LD and linkage between the markers 
and the likelihood of the data assuming linkage in the absence of LD. The positions of 
15 the microsatellites and SNP markers utilized in the analysis are shown on the X-axis, 
while the position of the DGATl SNPs is marked by a red arrow at the top of the figure. 

DETAILED DESCRIPTION OF THE INVENTION 

20 It has been discovered for the first time that the DGATl gene in bovine is associated 
with the QTL on chromosome 14 which is linked with improved milk production traits. 
More particularly, a number of novel polymorphisms on the DGATl gene have been 
discovered. It is thought that one or more of these polymorphisms is responsible for 
these traits. 

25 

The method used for isolating genes which cause specific phenotypes is known as 
positional candidate cloning. It involves: (i) the chromosomal localisation of the gene 
which causes the specific phenotype using genetic markers in a linkage analysis; and 
(ii) the identification of the gene which causes the specific phenotype amongst the 
30 "candidate" genes known to be located in the corresponding region. Most of the time 
these candidate genes are selected from available mapping information in humans and 
mice. 

The tools required to perform the initial localisation (step (i) above) are microsatellite 
, 35 marker maps, which are available for livestock species and are found in the public 
domain (Bishop et al., 1994; Barendse et al, 1994; Georges et a/., 1995; and Kappes, 
1997), The tools required for the positional candidate cloning, particularly the BAG 
libraries, (step (ii) above) are partially available from the public domain. Genomic 
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libraries with large inserts constructed with* Bacterial Artificial Chromosomes (BAG) are 
available in the public domain for most livestock species including cattle. For general 
principles of positional candidate cloning, see Collins, 1995 and Georges and Anderson, 
1996, 

5 

Recently, a quantitative trait locus (QTL) with major effect on milk solids composition, 
located at the centromeric end of bovine chromosome 14, hsis been reported (Coppieters 
et aly (1998)). This QTL was shown to effect milk fat content and in particular to 
significantly affect protein %, volume, protein yield and fat yield of milk. The linkage 
10 study as well as subsequent marker assisted segregation analyses allowed for the 
identification of thirteen Holstein-Friesian sires predicted to be heterozygous "Qg" for 
the corresponding QTL (Coppieters et al, (1998); Riquet et al, (1999)). 

Linkage disequilibrium methods were applied to refine the map position of the QTL to a 
15 « 5 cM interval bounded by microsatellite markers BULGE09 and BULGE30. 

A bovine DGATl nucleotide sequence was determined by the applicants and is shown in 
Figures 2a and 2b with the corresponding amino acid sequences (long and short forms) 
being shown in Figures 4a and 4b respectively. Table 1 sets out all the polymorphisms 
20 located to date with reference to the sequence in Figure 2b. Some of the genetic 
polymorphisms identified in the bovine DGATl gene are reported in Figure 3. The 
nucleic acid and protein sequences of the DGATl alleles including the K232A mutation 
are shown in Figures 2a and 2b (SEQ ID NOs: 3 and 1), aimotated to show the 
alternatively spliced forms. The cDNA sequence is also set out in SEQ ID NO: 4. 

25 

The sequence information in the Figures gives rise to numerous, and separate, aspects 
r of the invention. 

In one aspect, the invention provides a method of determining genetic merit of a bovine 
30 with respect to milk composition and volume which comprises the step of determining 
the bovine DGATl genotypic state of said bovine. In particular, this method is useful 
for genotyping and selecting cows and bulls having the desired genotypic state so that 
milk and semen may be collected from said cows and bulls respectively. Such semen 
would be useful for breeding purposes to produce bovine having the desired genotypic 
35 and, as a result, phenotypic state. In addition, cows genotyped by the methods of the 
present invention are also useful for breeding purposes, particularly for breeding with 
the selected bulls and/ or to be artificially inseminated with the semen from selected 
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bulls. The embryos and offspring produced by such cows also form part of the present 
invention. 

In one embodiment, the genotypic state is determined with respect to DNA obtained 
5 from said bovine. 

Alternatively, said genotypic state is determined with reference to mRNA obtained frpm 
said bovine. 

10 In yet a further embodiment, the genotypic state is determined with reference to the 
amino acid sequence of expressed bovine DGATl protein obtained from said bovine. 

Conveniently, in said method, the genotypic state of DNA encoding bovine DGATl is 
determined, directly or indirectly. 

15 

Alternatively, in said method the genotypic state of at least one nucleotide difference 
from the nucleotide sequence encoding bovine DGATl is determined, directly or 
indirectly. 

20 More specifically, in said method the genotypic state of bovine DGATl allele(s) 
characterised by one or more of the polymorphisms shown in Table 1 below, is 
determined, directly or indirectly. 
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Table 1: 

Table of polymorphisms in the bovine DGATl gene 

5 Start codon (atg); the a residue is denoted as position 1 



Base number relative to 
exonic sequence* 


Nucleotide distance 
firom start 


substitution 


Intron/exon 


Nt 191 + 435 


626 


T-G 

CAGTGCTAGGGG 
CAGTGCGAGGGG 


Intron 1 


Nt 191 + 3321 


3512 


T-G 

GCATTGCGCT 
GCATGGCGCT 


Intron 1 


Nt 279 + 144 


4040 


T-C 

TACCCTGGGAC 
TACCCCGGGAC 


Intron 2 


Nt 279 +1067 


4963 


A-G 

CTCTTAGCAGC 
CTCTTGGCAGC 


Intron 2 


Nt 279 +1107 


5003 


G-A 

ACAGGCAACT 
ACAGACAACT 


Intron 2 


Nt358 


5997 


C-T 

TGTCTCTGTTC 
TGTCTTTGTTC 


ExonlV 


Nt 692 


6829 


AA-GC 

GGTAAGAAGGCCAA (Q) 
GGTAAGGCGGCCAA (q» 


K232A Exon 

vin * 


Nt 754 +3 


6892 


G-A 

GCGGTGAGGAT 
GCGGTAAGGAT 


Intron VIII 


Nt 897 +32 


7224 


GO-AC 

GGGGGGGGGGGACTCT 
GGGGGACGGGGACTCT 


' Intron X 


Nt 984 +8 


7438 


A-G 

GAGTGACCTGC 

GAGTGGCCTGC 


Intron XII * 


Nt 984 +26 


7456 


C-T 

GGACGCGTGGG 
GGACGTGTGGG 


Intron Xn * 


Ntl251 +42 


7987 


G-A 

GGTGGGGGTGG 
GGTGGAGGTGG 


Intron XV 


Nt 1470 +85 . 


8402 


C-T 

CTGGGCGCAGC 
CTGGGTGCAGC 


3* flanking 
region * 



The numbers given are for the actual nucleotide or in the case of two nucleotide substitutions to the first nucleotide 
in the variation (counting 5' to 31 
1 0 *More detail of these polymorphisms is given in Figure 2b. 

J e.g. Nt 191 represents nucletode number 191 from the start site of the coding sequence, + 435 represents number 
of nucleotides from and including base 192 in the genomic sequence (intron 1) to the pofymorphic nucleotide 
The polymorphic nucleotides are shaded 



15 Preferably, the invention is directed to a method of determining the genotypic state of 
bovine DGATl allele(s) by determining the presence of the K232A polymorphism, either 
directly or indirectly. 



There are numerous art standard methods known for determining whether a particular 
20 DNA sequence is present in a sample. An example is the Polymerase Chain Reaction 
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(PGR). A preferred aspect of the invention thus includes a step in which ascertaining 
whether a polymorphism(s) in the sequence of. DGATl DNA is present, includes 
amplifying the DNA in the presence of primers based on the nucleotide sequence of the 
DGATl gene and flanking sequence, and/or in the presence of a primer containing at 
5 least a portion of a polymorphism as disclosed herein and which when present results 
in altered relative milk lipid and protein production, and milk volume. 

A primer of the present invention, used in PGR for example, is a nucleic acid molecule 
sufficiently complementary to the sequence on which it is based and of sufficient length 

10 to selectively hybridise to the corresponding portion of a nucleic acid molecule intended 
to be amplified and to prime synthesis thereof under in vitro conditions commonly used 
in PGR. Likewise, a probe of the present invention, is a molecule, for example a nucleic 
acid molecule of stif&dent length and sufiiciendy complementary to the nucleic acid 
molecule of interest, which selectively binds under high or low stringency conditions 

15 with the nucleic acid sequence of interest for detection thereof in the presence of nucleic 
acid molecules having differing sequences. 

In another aspect, the invention provides a method for determining the genetic merit of 
bovine with respect to milk content and volume with reference to a sample of material 
20 containing mRNA obtained from the bovine. This method includes ascertaining whether 
a polymorphism(s) in the sequence of the mRNA encoding DGATl is present. The 
presence of such polymorphisms again indicates an association with altered relative 
milk lipid and protein production and milk volume. 

25 Again, if an amplification method such as PGR is used in ascertaining whether a 
polymorphism(s) in the sequence of the mRNA encoding (DGATl) is present, the method 
includes reverse transcribing the mRNA using a reverse transcriptase to generate a 
cDNA and then amplifying the cDNA in the presence of a pair of primers complementary 
to a nucleotide sequence encoding a protein having biological activity of wild type 

30 DGATl. 

In a further aspect,, the invention includes the use of a probe in the methods of 
genotyping according to the invention wherein the probe is selected from any 5 or more 
contiguous nucleotides of the DGATl sequence as shown in Figure 2b, which is 
35 therefore sufficiently complementary with a nucleic acid sequence encoding such bovine 
DGATl, or its complement, so as to bind thereto under stringent conditions. Diagnostic 
kits containing such a probe are also included. Such probes may be selected from 
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ForAA (FAM): CGTTGGCCTTCTTA or DgatADGC (VIC): TTGGCCGCCTTACC. (SEQ ID 
NOs: 20 and 21 respectively.) 

The invention further includes isolated nucleic acid molecules encoding the DGATl 
5 variant proteins i.e. those proteins encoded by SEQ ID NOs: 1 and 4 (Figure 2b), with or 
without one or more polymorphisms of SEQ. ID Nos: 7 to 19 (Table 1), or a fragment or 
variant thereof. Particularly, the invention includes an isolated nucleic acid molecule 
comprising a DNA molecule having in whole or in part the nucleotide sequence 
identified in Figure 2b or which varies from the sequence due to the degeneracy of the 
10 genetic code, or a nucleic acid strand capable of hybridising with said nucleic acid 
molecule under stringent hybridisation conditions. 

The invention includes isolated mRNA transcribed from DNA having a sequence which 
corresponds to a nucleic add molecule of the invention. 

15 

The invention includes isolated DNA in a recombinant cloning vector and a prokaiyotic 
or eukaryotic cell containing and expressing heterologous DNA of the invention. 

The invention includes a transfected cell line which expresses a protein encoded by the 
20 nucleic acid molecules of the invention. 

The invention also includes a primer composition useful for detection of the presence of 
DNA encoding DGATl and/or the presence of DNA encoding a variant protein. In one 
form, the composition can include a nucleic acid primer substantially complementary to 
25 a nucleic acid sequence encoding DGATl. The nucleic acid sequence can in whole or in 
part be that identified in Figure 2b. Diagnostic kits including such a composition are 
also included. 

The invention further provides a diagnostic kit useful in detecting DNA encoding a 
30 variant DGATl protein in bovine which includes first and second primers for amplifying 
the DNA, the primers being complementary to nucleotide sequences of the DNA 
upstream and downstream, respectively, of a polymorphism in the portion of the DNA 
encoding DGATl which results in altered relative milk lipid, solid fat content and 
protein production and milk volume, wherein at least one of the nucleotide sequences is 
35 selected to be from a non-coding region of the DGATl gene. The.kit can also include a 
third primer complementary to a polymorphism, disclosed herein, located on the 
DGATl gene. 
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The invention includes a process for producing a protein of the invention, including 
preparing a DNA fragment including a nucleotide sequence which encodes the protein; 
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA 
5 molecule which includes the DNA fragment and is capable of undergoing replication; 
transforming a host cell with the recombinant DNA molecule to produce a transformant 
which can express the protein; culturing the transformant to produce the protein; and 
recovering the protein from resulting cultured mixture. 

10 Thus in a further aspect, the invention provides a purified protein having biological 
activity of DGATl. The terms "isolated" and "purified'' as used herein, each refer to a 
protein substantially free of cellular material or culture medium when produced by 
recombinant DNA techniques, or chemical precursors or other chemicals when 
chemically synthesised. In certain preferred embodiments, the protein having biological 

15 activity of DGATl comprises an amino acid sequence and variants shown in Figures 4a 
and 4b (SEQ ID NOs: 2, 5 and 6). Furthermore, proteins having biological activity of 
. DGATl that are encoded by nucleic adds which hybridise under stringent conditions to 
a nucleic acid comprising a nucleotide sequence shown in Figure 2b (SEQ ID NOs: 1 
and 4) are encompassed by the invention. ' 

20 

Proteins of the invention having DGATl activity can be obtained by expression of a 
nucleic acid coding sequence in a suitable host cell using techniques known in the art. 
Suitable host cells include prokaryotic or eukaryotic organisms or cell lines, for 
example, yeast, E, coli, insect cells and COSl cells. The recombinant expression vectors 

25 of the invention can be used to express a protein having DGATl activity in a host cell in 
order to isolate the protein. The invention provides a method of preparing a purified 
protein of the invention comprising introducing into a host cell a recombinant nucleic 
acid encoding the protein, allowing the protein to be expressed in the host cell and 
isolating and purifying the protein. Preferably, the recombinant nucleic acid is a 

30 recombinant expression vector. Proteins can be isolated from a host cell expressing the 
protein and purified according to standard procedures of the art, including ammonium 
sulfate precipitation, column chromatography (eg. ion exchange, gel filtration, affinity 
chromatography, etc.) electrophoresis, and ultimately, crystallisation (see generally 
"En2yme Purification and Related Techniques". Methods in Enzyjnology, 22, 233-577 

35 (1971)). 
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Alternatively, t±Le protein or parts thereof can be prepared by chemical synthesis using 
techniques well known in the chemistry or proteins such as solid phase synthesis 
(Merrifield, 1964), or synthesis in homogeneous solution (Houbenwcyl, 1987). 

5 It will of course be understood that a variety of substitutions of amino acids is possible 
while preserving the structure responsible for activity of the DGATl proteins disclosed 
herein. Conservative substitutions are described in the patent literature, as for 
example, in United States Patent No 5,264,558 or 5,487,983. It is thus expected, for 
example, that interchange among non-polar aliphatic neutral amino acids, glycine, 

10 alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions 
among the polar aliphatic neutral amino acids, serine, threonine, methionine, 
asparagine and glutamine could possibly be made. Substitutions among the charged 
acidic amino acids, aspartic acid and glutamic acid, could probably be made, as could 
substitutions among the charged basic amino acids, lysine and arginine. Substitutions 

15 among the aromatic amino acids, including phenylalanine, histidine, tryptophan and 
tyrosine would also likely be possible. These sorts of substitutions and interchanges 
are well known to those skilled in the art. Other substitutions might well be possible. 
Of course, it would also be expected that the greater percentage of homology ie. 
sequence similarity, of a variant protein with a naturally occurring protein, the greater 

20 the retention of activity. 

A further advantage may be obtained through chimeric forms of the proteins, as known 
in the art. A DNA sequence encoding each entire protein, or a portion of the protein, 
cQuld be linked, for example, with a sequence coding for the C-terminal portion of E. coli 
25 P-galactosidase to produce a fusion protein. 

The proteins of the invention, or portions thereof, have numerous applications in turn. 
By way of example, each protein can be used to prepare antibodies which bind to a 
distinct epitope in an unconserved region of the protein. An unconserved region of the 
30 protein is one which does not have substantial sequence homology to other proteins. 

Still further, the invention includes an antibody to a bovine DGATl variant protein 
encoded by a nucleotide sequence of the present invention as well as a diagnostic kit 
containing such an antibody. 

35 

Conventional methods can be used to prepare the antibodies.. For example, by using a 
DGATl peptide, polyclonal antisera or monoclonal antibodies can be made using 
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Standard methods. A mammal, (eg. a mouse, hamster, or rabbit) can be immunised 
with an immunogenic form of the peptide which elicits an antibody response in the 
mammal. Techniques for conferring immunogenicity on a peptide include conjugation 
to carriers or other techniques well known in the art. For example, the peptide can be 
5 administered in the presence of adjuvant. The progresis of immunisation can be 
monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 
immunoassay can be used to assess the levels of antibodies. Following immunisation, 
antisera can be obtained and, if desired, polyclonal antibodies isolated from the sera. 

10 To produce monoclonal antibodies, antibody producing cells (Ijnmphocytes) can be 
harvested from an immunised animal and fused with myeloma cells by standard 
somatic cell fusion procedures, thus immortalising these cells and yielding hybridoma 
cells. Such techniques are well known in the art. For example, the hybridoma 
technique originally developed by Kohler and Milstein (Kohler, J975) as well as other 

15 techniques such as the human B-cell hybridoma technique (Kozbor, 1983) and 
screening of combinatorial antibody libraries (Huse, 1989). Hybridoma cells can be 
screened immunochemically for production of antibodies specifically reactive with the 
pepdde, and monoclonal antibodies isolated. 

20 The term "antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with the target protein. Antibodies can be fragmented using 
conventional techniques and the fragments screened for utility in the same manner as 
described above for whole antibodies. For example, F{ab')2 fragments can be generated 
by treating antibody with pepsin. The resulting F(ab')2 fragment can be treated to 

25 reduce disulfide bridges to produce Fab' fragments. 

Another method of generating specific antibodies, or antibody fragments, reactive 
against the target proteins is to screen expression libraries encoding immunoglobulin 
genes, or portions thereof, expressed in bacteria, with peptides produced from the 

30 nucleic acid molecules of the present invention. For example, complete Fab fragments, 
VH regions and FV regions can be expressed in bacteria using phage expression 
libraries. See for example Ward et ah, Huse et al, and McCafferty et al (Ward, 1989); 
Huse 1989; McCafferty, 1990). Screening such libraries with, for example, a DGATl 
protein can identify immunoglobulin fragments reactive with that DGATl. Alternatively, 

35 the SCID-hu mouse developed by Genpharm can be used to produce antibodies, or 
fragments thereof. - 
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The polyclonal, monoclonal or chimeric monoclonal antibodies can be used to detect the 
proteins of the invention, portions thereof or closely related isoforms in various 
biological materials. For example, they can be used in an ELISA, radioimmunoassay or 
histochemical tests. Thus, the antibodies can be used to quantify the amount and 
5 location of a DGATl protein of the invention, portions thereof or closely related isoforms 
in a sample in order to determine the role of DGATl proteins. Using methods described 
hereinbefore, polyclonal, monoclonal antibodies, or chimeric monoclonal antibodies can 
be raised to non-conserved regions of DGATl and used to distinguish a particular 
DGATl from other proteins. 

10 

The polyclonal or monoclonal antibodies can be coupled to a detectable substance or 
reporter system. The term "coupled" is used to mean that the detectable substance is 
physically linked to the antibody. Suitable detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials and 

15 radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, p-galactosidase, and acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinyiamine fluorescein, dansyl chloride and 

20 phycoerythrin; an example of a luminescent material includes luminol; and examples of 
suitable radioactive material include i^sjj laij^ 355 and ^H. In a preferred embodiment, 
the reporter system allows quantitation of the amount of protein (antigen) present. 

Such an antibody-linked reported system could be used in a method for determining 
25 whether a fluid or tissue sample of a bovine contains a deficient amount or an excessive 
amount of the relevant DGATl protein. Given a normal threshold concentration of such 
a protein, test kits can be developed. 

The availability of such antibodies gives rise to further applications. One is a diagnostic 
30 kit for identifying cells comprising an antibody (such as a monoclonal antibody) which 
binds to a protein comprising an amino acid sequence shown in Figure 4a and 4b; 
means for detecting the antibody when bound to the protein, unreacted protein or 
unbound antibody; means for determining the amount of protein in the sample; and 
means for comparing the amount of protein in the sample with a standard. In some 
35 embodiments of the invention, the detectability of the antibody which binds to a specific 
DGATl protein is activated by the binding (eg. change in fluorescence spectrum, loss of 



wo 02/36824 



PCT/NZOl/00245 



15 

radioisotopic label). The diagnostic kit can also contain an instruction manual for use 
of the kit. 

Antibody-based diagnostics are of course not the only possibility. A further diagnostic 
5 kit comprises a nucleotide probe complementary to the sequence, or an oligonucleotide 
fragment thereof, shown in Figure 2a and 2b, for example, for hybridisation with mRNA 
from a sample of cells; means for detecting the nucleotide probe bound to mRNA in the 
sample with a standard. In a particular aspect, the kit of this aspect of the invention 
includes a probe having a nucleic acid molecule sufficiently complementary with a 

10 sequence identified in Figure 2a and 2b, or its complement, so as to bind thereto under 
stringent conditions. ''Stringent hybridisation conditions" takes on its common 
meaning to a person skilled in the art. Appropriate stringency conditions which 
promote nucleic acid hybridisation, for example, 6x sodium chloride/ sodium citrate 
(SSC) at about A-S°C are known to those skilled in the art, including in Current 

15 Protocols in Molecular Biology, John Wiley & Sons, NY (1989). Appropriate wash 
stringency depends on degree of homology eind length of probe. If homology is 100%, a 
high temperature (65°C to 75°C) may be used. If homology is low, lower wash 
temperatures must be used. However, if the probe is veiy short (<100bp), lower 
temperatures must be used even with 100% homology. In general, one starts washing 

20 at low temperatures (37oC to 40oC), and raises the temperature by S-S^C intervals until 
background is low enough not to be a major factor in autoradiography. The diagnostic 
kit can also contain an instruction manual for use of the kit. 

One of the major applications of the present invention is in the marker assisted 
25 selection of bovines having a polymorphism in the DGATl gene and which are 
associated with improved milk production traits. The invention therefore provides a 
diagnostic kit which can be used to determine the DGATl genotype of bovine genetic 
material, for example. One kit includes a set of primers used for amplifying the genetic 
material. A kit can contain a primer including a nucleotide sequence for amplifying a 
30 region of the genetic material containing one of the polymorphisms described herein. 
Such a kit could also include a primer for amplifying the corresponding region of the 
normal DGATl gene, i.e. the sequence without poljnxiorphisms. Usually, such a kit 
would also include, another primer upstream or downstream of the region of interest 
complementary to a coding and/or non-coding portion of the gene. These primers are 
35 used to amplify the segment containing the mutation, i.e. polymorphism, of interest. 
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In particular, the invention is directed to the use of the polymorphisms in the DGATl 
gene in the genotyping of cows and bulls as well as to cows and bulls selected by such 
genotyping which have one or more of said polymorphisms in the DGATl gene. Such 
bulls so selected are of valuable breeding stock and the invention is also directed to the 
5 semen produced by such selected bulls for breeding purposes. Cows so selected are 
also useful as breeding stock as are their offspring. In addition, such cows may 
produce valuable dairy herds as the milk produced by such cows is produced in greater 
volumes than equivalent non- selected cows, and/ or has an altered composition in that 
it comprises less milkfat and more milk protein. Such milk and products made 
10 therefrom also form part of the invention. It is also noted that the milk from these 
selected cows will be valuable as the fat content is not only decreased but is also 
characterised by being softer. Without being bound by theory, it is thought that this 
increased fat softness is due to the fatty acid composition being such that there is less 
saturated and more unsaturated fat in the milk of selected cows. Thus it is anticipated 
15 • that products made from such milk will have processing advantages, such as in the 
production of more spreadable butter, as well as having a health benefit on consumers, 
as generally unsaturated fats are considered to be more "healthy" than saturated fats. 
The protein composition of milk produced by such selected cows is also altered. In 
particular, such milk comprises an altered protein yield compared to milk for non- 
20 selected cows and the casein:whey ratio is also altered which makes such milk valuable . 
for cheese production. 

Thus, the present invention involves genotyping bovine, both cows and bulls, for the 
DGATl polymorphisms disclosed herein, selected cows and bulls so genotyped, milk 
25 and semen produced by the selected cows and bulls so genotyped, offspring produced 
by the selected bovine, including embryos and cells (including cell lines) useful for 
cloning said selected bovine. 

The actual genotyping is carried out using primers that target specific polymorphisms 
30 as described herein and that could function as allele- specific oligonucleotides in 
conventional hybridisation, Taqman assays, OLA assays, etc. Alternatively, primers can 
be designed to permit genotyping by micro sequencing. 

One kit of primers can include first, second and thfrd primers, (a), (b) and (c), 
35 respectively. Primer (a) is based on a region containing a DGATl mutation such as 
described above. Primer (b) encodes a region upstream or downstream of the region to 
be amplified by primer (a) so that genetic material containing the mutation is amplified, 
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by PGR, for example, in the presence of the two primers. Primer (c) is based on the 
region corresponding to that on which primer (a) is based, but lacking the mutation. 
Thus, genetic material containing the non-mutated region will be amplified in the 
presence of primers (b) and (c). Genetic materisd homozygous for the DGATl gene will 
5 thus provide amplified products in the presence of primers (b) and (c). Genetic material 
homozygous for the mutated gene will thus provide amplified products in the presence 
of primers (a) and (b). Heterozygous genetic material will provide amplified products in 
both cases. 

10 The present invention also contemplates the modulation of miUc production and content 
in non-human animals by modulating the activity of the DGATl protein. In particular, 
this aspect of the invention includes a method of modulating milk production and/ or 
milk content in a lactating bovine, the method comprising administering to the bovine 
an effective amount of a nucleic acid molecule substantially complementsiry to at least a 

15 portion of mRNA encoding the bovine DGATl variant proteins and being of sufficient 
length to sufficiently reduce expression of said DGATl, i.e. by use of antisense nucleic 
acids. 

Antisense nucleic acids or oligonucleotides (RNA or preferably DNA) can be used to 
20 inhibit DGATl production in a bovine if this is considered desirable e.g. in order to 
produce a bovine capable of improved milk production, i.e. increased milk volume and 
decreased milkfat content. Antisense oligonucleotides, typically 15 to 20 bases long, 
bind to the sense mRNA or pre mRNA region coding for the protein of interest, which 
can inhibit translation of the bound mRNA to protein. The cDNA sequence encoding 
25 DGATl can thus be used to design a series of oligonucleotides which together span a 
large portion, or even the entire cDNA sequence. These oligonucleotides can be tested 
to determine which provides the greatest inhibitory effect on the expression of the 
protein (Stewart 1996). The most suitable mRNA target sites include 5'- and 3- 
untranslated regions as well as the initiation codon. Other regions might be found to be 
30 more or less effective. 

Alternatively, an antisense nucleic acid or oligonucleotide may bind to DGATl coding 
sequences- 

35 In yet another embodiment, the invention provides a method of modulating milk 
production , and/ or milk content in a lactatirig bovine, including administering to the 
bovine an effective amount of a nucleic acid molecule having ribozyme activity and a 
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nucleotide sequence substantially complementary to at least a portion of mRNA 
encoding a bovine DGATl and being of sufficient, length to bind selectively thereto to 
sufEcientty reduce expression of said DGATl. 

5 Rather than reducing DGATl activity in the bovine by inhibiting gene expression at the 
nucleic acid level, activity of the relevant DGATl protein may be directly inhibited by 
binding to an agent, such as, for example, a suitable small molecule or a monoclonal 
antibody. 

10 Thus, the invention also includes a method of inhibiting the activity of bovine DGATl in 
a lactating bovine so as to modulate milk production and/or milk solids content, 
comprising administering an effective amount of an antibody to the relevant DGATl. 

The invention still further includes a method of modulating milk production and/ or 
15 milk solids content by raising an autoantibody to a bovine DGATl in the bovine. 
Raising the autoantibody can include administering a protein having DGATl activity to 
the bovine. 

In still a further embodiment, nucleic acids which encode DGATl proteins can be used 
20 to generate transgenic animals. A transgenic animal (eg. a mouse) is an animal having 
ceils that contain a transgene, which transgene is introduced into the animal or an 
ancestor of the animal at a prenatal, eg. an embryonic stage. A transgene is DNA which 
is integrated into the genome of a cell from which a transgenic animal develops. In one 
embodiment, a bovine cDNA, comprising the nucleotide sequence shown in Figure 2b, 
25 or an appropriate variant or subsequence thereof, can be used to generate transgenic 
animals that contain cells which express the relevant DGATl . Likewise, variants can be 
used to generate transgenic animals. "Knock out" animals can also be generated. 

Methods for generating transgenic animals, particularly animals such as mice, have 
30 become conventional in the art are described, for example, in US Patent Nos. 4,736,866 
and 4,870,009. In such methods, plasmids containing recombinant molecules are 
microinjected into mouse embryos. In particular, the plasmids can be microinjected 
into the male pronuclei of fertilised one-cell mouse eggs; the injected eggs transferred to 
pseudo-pregnant foster females; and the eggs in the foster females allowed to develop to 
35 term. (Hogan, 1986). Alternatively, an embryonal stem cell can be transfected with an 
expression vector comprising nucleic acid encoding a DGATl protein, and cells 
containing the nucleic acid can be used to form aggregation chimeras with embryos 
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from a suitable recipient mouse strain. The chimeric embryos can then be implanted 
into a suitable pseudopregnant female mouse of the appropriate strain and the embryo 
brought to term. Progeny harbouring the transfected DNA in their germ cells can be 
used to breed uniformly transgenic mice. 

5 

Such animals could be used to determine whether a sequence related to an intact 
DGATl gene retains biological activity of the encoded DGATl. Thus, for example, mice 
in which the murine DGATl gene has been knocked out and containing the nucleic acid 
sequence identified in Figure 2b or fragment or variant thereof could be generated. The 
10 animals could be examined with reference to milk production and content. 

The pattern and extent of expression of a recombinant molecule of the invention in a 
transgenic mouse is facilitated by fusing a reporter gerie to the recombinant molecule 
such that both genes are" co-transcribed to form a polycistronic mRNA. The reporter 

15 gene can be introduced into the recombinant molecule using conventional methods 
such as those described in Sambrook et al, (Sambrook, 1989). Efficient expression of 
both cistrons of the polycistronic mRNA encoding the protein of the invention and the 
reporter protein can be achieved by inclusion of a known internal translational 
initiation sequence such as that present in poliovirus mRNA. The reported gene should 

20 be under the control of the regulatory sequence of the recombinant molecule of the 
invention and the pattern and extent of expression of the gene encoding a protein of the 
invention can accordingly be determined by assaying for the phenotype of the reporter 
gene. Preferably the reporter gene codes for a phenotype not displayed by the host cell 
and the phenotype can be assayed quantitatively. Examples of suitable reporter genes 

25 include lacZ (P-galactosidase), neo (neomycin phosphotransferase)^ CAT 
(chloramphenicol acetyltransferase) dhfr (dihydrofolate reductase), aphlV (hygromycin 
phosphotransferase), lux (luciferase), uidA (^-glucuronidase). Preferably, the reporter 
gene is lacZ which codes for p-galactosidase. p-galactosidase can be assayed using the 
lactose analogue X-gal (5-bromo-4-chloro-3-indolyl-p-D-galactopyranoside) which is 

30 broken down by p-galactosidase to a product that is blue in colour. 

Still fuither transgenic applications of the invention arise from knocking out the 
endogenous gene encoding DGATl in mammals and replacing this with a transgene, in 
order to obtain a desired effect. This is particularly true in cattle raised for milk 
35 production. For example, additional copies of the gene encoding DGATl can be inserted 
as a transgene, or the endogenous gene associated with a high level expression 
promoter in a transgene. It may also prove advantageous to substitute a defective gene 
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rather than delete the entire sequence of DNA encoding for a protein having DGATl 
activity. A method of producing a transgenic bovine or transgenic bovine embryo is 
described in United States Patent No. 5,633,076, issued 27 May 1997, for example. 

5 These transgenic animals of the invention can again be used to investigate the 
molecular basis of DGATl action. For example, it is expected that mutants in which 
one or more of the conserved cysteine residues has been deleted wotild have diminished 
activity in relation to a DGATl protein in which all such residues are retained. Further, 
deletion of a proteolytic cleavage site would likely result in a mutant lacking biological 
10 activity of DGATl. 

Transgenic animals of the invention can also be used to test substances for the ability 
to prevent, slow or enhance DGATl activity. A transgenic animal can be treated with 
the substance in parallel with an untreated control transgenic animal. Substances 
15 which could be tested in this way include proteins extracted from foods ingested by the 
animal. For example, proteins extracted from pastoral grasses and other fodder can be 
tested to determine their effect on DGATl activity, including to determine whether 
breed-specific effects can be induced. 

20 Thus, in further aspects, the invention provides transgenic non-human animals. These 
include by way of example only a transgenic bovine having a genome lacking a gene 
encoding a protein having biological activity of DGATl (or indeed any DGATl activity at 
all); a transgenic mouse having a genome containing a gene encoding a bovine protein 
having biological activity of any DGATl; and a transgenic bovine having a gene 

25 encoding a bovine protein having biological activity of a bovine DGATl and heterologous 
nucleotide sequence antisense to the gene. The transgenic bovine can include a gene 
encoding a nucleic acid sequence having ribozyme activity and in transcriptional 
association with the nucleotide sequence antisense to the gene. 

30 The invention further provides a transgenic bovine having a genome which includes 
additional copies of a gene encoding a^protein having biological activity of DGATl or 
copies of a gene encoding a protein having biological activity of DGATl under control of 
a high expression promoter. 

35 These are but a selection of the applications of this invention. Others will be apparent 
to those persons skilled in this art and are in no way excluded. To the contrary, the 
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invention extends to cover not only the specific teaching provided but also all variations 
and modificatiohs which are within the skill and contemplation of the addressee. 

The invention will now be defined by specific examples which are illustrative only and 
5 are not intended to limit the invention in any way. 

EXPERIMENTAL 

1. Location of the gene responsible for the observed QTL 

10 

Construction of a BAC contig spanning the BULGE9-BULGE30 interval 
In order to clone the gene(s) responsible for the observed QTL effect, a BAC cont^ 
spanning the corresponding marker interval was constructed. This was accomplished 
by screening a BAC library by filter hybridisation with the microsatellite markers 

15 ava£able for proximal BTA14q, as well as with human cDNA clones mapping to the 
orthologous chromosome segment on the human RH transcript map: 8q23.3-ter (Riquet 
et aly (1999)). The ends of the isolated BACs were sequenced, sequence tagged sites 
(STS) developed from the corresponding sequences, and mapped onto a bovine x 
hamster whole genome radiation hybrid panel. This STS content mapping approach 

20 lead to the construction of the BAC contig shown in Figure 1. 

DGATl maps to the BULGE9'BULGE30 interval and is a strong positional candidate for 
the QTL. 

A murine gene encoding a protein with Diacylglycerol-o-acyltransferase (DGATl) activity 
25 was identified (Cases et al, (1998)) and shown to completely inhibit lactation when 
knocked out in the mouse (Smith et al., (2000)). This gene was reported in the hximan 
to map to HSABqter (Cases et al, (1998)), ie. in the region orthologous to that 
containing the bovine QTL. Screening the publicly available databases with the 
published murine and human DGATl cDNA sequences allowed identification of (i) a 
30 human BAC clone containing the human DGATl gene (AF205589), and (ii) three bovine 
Expressed Sequence Tags (AW446908; AW446985; AW652329) jointly covering 
approximately two thirds of the bovine gene. Aligning the himian DGATl genomic 
sequences with the human and bovine cDNA sequences allowed the corresponding 
intron-exon boundaries to be identified. Primers were developed to PCR amplify a 
35 portion of the bovine DGATl gene. Screening the BACs composing the BULGE9- 
BULGE30 contig clearly indicated that the bovine DGATl gene was contained in a 
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subset of the BACs allowing us to accurately position the DGATl gene in liie contig of 
Figure 1. 

These results demonstrated that the map position of DGATl coincided with the most 
5 likely position of the chromosome 14 QTL as determined by linkage and linkage 
disequilibrium analyses. Knowing that the QTL primarily affects fat content, knowing 
the enzymatic activity of DGATl and the effect of a DGATl knock-out on lactation, this 
gene was considered to be a very strong positional candidate for the corresponding QTL. 

10 Organisation of the bovine DGATl gene 

The organisation of the bovine DGATl gene was determined by sequence analysis of one 
of the DGATl containing BACs. Primers were designed based on the available bovine, 
murine and human cDNA sequences which were either used for direct sequencing of 
the BAG clone or to generate PGR products corresponding to different parts of the 

15 bovine DGATl gene from this BAG which were then subjected to cycle-sequencing. All 
available sequences were then merged using the Phred / Phrap software (Ewing et aL, 
(1998); Ewing & Green, (1998); Gordon et al, (1998)) to yield the consensus sequence 
shown in Figures 2a and b. 

20 RT-PCR, 5' and 3' RACE experiments were performed on mRNA isolated from bovine 
mammary gland and the obtained PGR products subjected to cycle sequencing. 
Comparison of the genomic and cDNA sequences showed that the bovine DGATl gene 
spans 8.6 Kb and comprises 17 exons measuring 121.8 bp on average (range: 42 - 436 
bp) and allowed intron-exon boundaries to be identified (Figures 2£i, 2b and 3). The 

25 cDNA sequence is also set out in SEQ ID NO; 4. While the first two introns are 
respectively 3.6 and 1.9 Kb long, the remaining 14 introns are only 92.4 bp long on 
average (range: 70-215 bp). All introns conform to the GT-AG rule and are strictly 
conserved between human and bovine. The bovine DGATl gene is transcribed in a 
mRNA comprising >31 bp of 5' UTR sequence (Figure 2a), 1470 bp coding for a protein 

30 of 489 amino-acids, and 275 bp of 3' UTR sequence including a canonical AATAAA 
polyadenylation signal. The human and bovine DGATl nucleotide (coding) and protein 
sequences are respectively 89.5% and 92.5% identical (Figures 2a, 2b, 4a and 4b). In 
addition, an alternative splicing variant is predicted in the bovine for exon VIII (Figure 
2b). The corresponding bovine cDNAs are predicted to encode proteins comprising 

35 respectively 489 and 467 (alternative splicing variant) amino-acid residues (Figures 4a 
and 4b). 
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The predicted "Q" and "q" QTL alleles differ by a non conservative lysine to alanine aminch 
add substitution in the DGATl gene. 

Assiiming that DGATl is indeed the QTL, it is predicted that the identified "<7 and "q" 
QTL alleles will correspond to functionally distinct DGATl alleles, ie. will differ at one 
5 or more mutations causing these alleles to be fiinctionaily different. To test this 
hypothesis, the structure of the DGATl gene in individuals predicted to be of different 
QTL genotypes: "QQ*, "Q(f and "q<f was examined. More specifically, the DGATl gene 
from: 

10 (i) two sires with "HO^/hJ" genotype as well as two of their ''HP-o/hQ-d" 

offspring, two of their "hfl/h'f offspring and one "H<?-^/h9" offspring, and 

(H) one "W-^/M* sire with one of its "H^i^/HQ-wz" offspring 

15 was analysed wherein H9-^ corresponds to the Dutch Q haplotype and 

corresponds to the New Zealand Q haplotype, and primer pairs were designed that 
allowed for the amplification from genomic DNA of (i) the coding portion of exon I, (ii) 
exon II, and (iii) the chromosome regions spaiming exons III to XVII. The corresponding 
PGR products from the selected individuals were cycle-sequenced and the resulting 

20 sequences examined with the Polyphred software. 

Additional sequencing analysis, as described above, on DNA from a range of breeds 
revealed additional polymorphisms included in Table 1 (see Methods section for breeds). 
Four such polymorphisms were investigated further: 

25 

(i) K232A: a substitution of a ApA by a GpC dinucleotide in exon VIII (respectively 
positions 694 and 695 counting from the start codon in the cDNA). The substitution of 
these two adjacent nucleotides results in a non conservative lysine (hydrophylic basic 
amino acid) to alanine (hydrophobic amino acid) substitution in the DGATl protein. The 

30 lysine residue affected by this polymorphism is conserved in the human and murine 
DGATl sequences. Together with the resulting change in the electrical charge of the 
protein, this strongly suggests that this amino-acid substitution is likely to result in a 
functional difference between the two corresponding alleles and to be at least partly 
responsible for the observed QTL effect. 

35 

(ii) Nt9B4-¥8(B<ise 7438 A-GU A A to G substitution in intron 12, eight base pairs 
downstream of exon XIL Following standard nomenclature, this polymorphism will be 
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referred to as Nt984+8(A'G). This poljrmorphism cannot be predicted as such to modify 
the functionality of the corresponding alleles although an effect on the splicing 
mechanism cannot be excluded given its proximity to the intron-exon boundary. 

5 (Hi) m984+26tB€ise 7456 C-T): A C to t substitution in intron 12, 26 base pairs 
downstream of exon Xn. Following standard nomenclature, this polymorphism will be 
referred to as Nt984+26(Base 7456 C-Tj, Again, this polymorphism cannot be predicted 
as such to modify the functionality of the corresponding alleles although an effect on 
the splicing mechanism cannot be excluded given its proximity to the intron-exon 
10 boundary. 

(iv) Ntl470+8SaScLse 8402 C-T): A C to T substitution in the 3' UTR. Following 
standard nomenclature, this polymorphism will be referred to as Ntl470+85(Base 8402 
C-7). Again, this polymorphism cannot be predicted as such to modify the functionality 
15 of the corresponding alleles although an effect on polyadenylation or mRNA stability 
cannot be excluded. ^ 

Conclusion 

20 These four polymorphisms were shown to assort into three distinct SNP haplotypes 
referred to as sH^^, sH9'^ and shi because in the sequenced samples they coincided 
respectively with microsatellite haplotypes /iH9-^, /jH^^ and jufv. The base pair 
compositions of these three SNP haplotypes are shown in Figure 3. 

25 Because the sE9-^ and shB marker haplotypes share the G residue at the DGATl 
Nt984-^8(B(Xse 7438 A-G) site, the causality of this polymorphism in the determinism of 
the QTL could be excluded. For the three remaining polymorphic sites, however, the 
DGATl haplotypes associated with marker haplotypes sHP-^ and sHO-nz proved identical 
to each other while different from the shi DGATl haplotype. Either of these three 

30 polymorphisms could therefore be responsible for the observed QTL effect. The 
Nt984-i'26(Base 7456 C-7) and Ntl470-^85(Base 8402 C-Tj polymorphisms are a priori 
more likely to be neutral with respect to DGATl activity because of their respective 
location in an intron and the 3' UTR and likewise the other non coding or neutral 
polymorphism shown in Table 1. A direct effect of the K232A mutation on DGATl 

35 activity, however, is very plausible. Indeed, the corresponding lysine residue is 
conserved amongst all examined mammals (i.e. human, mouse, rat, pig, sheep, bison) 
demonstrating its functional importance (Figure 5). The evolutionary conservation of 
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this lysine residue also demonstrates that the K residue characterizing the sH^^ and 
sJiO-w? marker haplotypes is more than likely the ancestral state and that it is the A 
residue characterizing the sh^ haplotypes that corresponds to a more recently evolved 
state. 

5 

2* Genot3rpe Testing and Analysis I 

This summarises the genotjrpe testing and subsequent analysis of Holstein-Friesian 
animals sourced from New Zealand and Holland which were tested for the presence of 
10 the K232A pol3nmorphism. Reference to allele "Q" corresponds to the K residue and 
allele "q" to the A residue (as shown in Figure 3 and Table 1). 

An oligonucleotide ligation assay (OLA) was developed as described in the method 
section below that allows for efficient genotyping of the four DGATl polymorphisms 

15 simultaneously. This OLA- test was used to genotype a previously described (Famir et. 
al., 2000)- "grand-daughter design" (i.e. series of paternal half-brother pedigrees) 
comprising 1,818 Dutch Holstein-Friesian sires as well as a "daughter design" (i.e. 
series of paternal half-sister pedigrees) comprising 529 New Zealand Holstein-Friesian 
cows selected according to phenotype as described below. The marker linkage phase for 

20 each individual was determined as described below. 

Fig. 6 summarizes the frequency distribution of DGATl haplotypes encountered in the 
Dutch and New Zealand populations respectively. Four distinct SNP haplotypes were 
identified. Three of these correspond to the sHO-o^ sHQ-^ and sh^ that were previously 

25 identified by sequencing, and jointly account for 99% and 98% of the chromosomes in 
the Dutch and New-Zealand populations respectively. A fourth minor haplotype was 
found accounting for the remaining 1% and 2% of the chromosomes. As this haplotype 
codes for a K residue at position 232 it was assumed to correspond to a fat increasing 
"<7 allele and was therefore referred to as sHO-^ (Fig. 3). The observation that the K 

30 residue is found on three distinct DGATl haplot5rpes while the A residue is found on a 
unique DGATl haplotype is in agreement with K being the more ancient state. 

The sHQ-^ and sH9-^ SNP haplotypes (coding for a if residue at position 232) appear to 
be in strong linkage disequilibrium (LD) with the flanking microsatellite markers 
35 BULGE09 and BULGEll, as they are in essence associated with unique microsatellite 
haplotypes corresponding respectively to the' previously defined /J^^ and juHP-^ 
haplotypes (Fig. 6C&D). In sharp contrast, the sht haplotype (coding for an A residue at 
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position 232) is nearly evenly distributed across more than ten distinct microsatelUte 
haplotypes (Fig. 6B). 

These observations are in excellent agreement with the results of the combined linkage 
5 and LD analysis (Femier et. al., 2000). These studies indeed predicted (i) that in the 
Dutch population the vast majority (estimates ranging from 81% to 92%) of "Q** allele (= 
Kj would reside on the ^iH^^ microsateilite haplotype, (ii) that in the New Zealand 
population a large fraction (estimates ranging from 36% to 51%) of alleles would 
reside on haplotype /2H<>^^ (we now see that the remainder correspond, mainly to the 
10 fiH^^ microsateilite haplotype) and (iii) that in both populations the "g" alleles (= A) 
would correspond to mxaltiple marker haplotypes, corresponding to hn . 

Figure 7 illustrates the gain in LD signal that could be obtained in the Dutch Holstein- 
Friesian grand-daughter design when adding the DGATl polymorphisms to the 

15 previously available markers for proximal BTA14q and performing a joint linkage and 
LD multipoint analysis (Femier et. al., 2000) using the sires ''daughter yield deviations'' 
(DYD (Van Raden and Wiggans, 1991) corresponding to half breeding values) for milk 
fat percentage as phenotype. It can be seen that the lod score attributable to LD 
essentially doubles (from 3.7 to 7.8), and maximizes exactly at the position of the 

20 DGATl gene. This result strongly supports the causal involvement of the DGATl gene in 
the QTL effect. The corresponding ML estimates of the "Q* to "<f allele substitution 
effect (a/2) (as defined in Falconer and Mackay, 1996), residual standard deviation (a), 
population frequency of the "Q'' allele (/<?), number of generations to coalescence (g) and 
heterogeneity parameter (p) were respectively 0.11% (a/ 2), 0.06% (a), 0.20 [foj, 5 (g) and 

25 0.84 (p). 

Using the same Dutch Holstein-Friesian population, the additive effect of the DGATl 
K232A polymorphism on milk yield and composition was examined. The . sons DYDs for 
milk yield (kgs), protein yield (kgs), fat yield (kgs), protein percentage and fat 

30 percentage, were analysed using a mixed model including (i) a regression on the 
number of alleles in the genotype (0, 1 or 2), and (ii) a random polygenic component 
estimated using an individual animal model and accounting for all known pedigree 
relationships. Table 2 below, reports the obtained results. It can be seen that the 
K232A mutation has an extremely significant effect on the five analysed dairy traits. 

35 The proportion of the trait variance explained by this polymorphism in this population 
ranges from 8% (protein yield) to 51% (fat percentage), corresponding to between 10% 
(protein yield) and 64% (fat percentage) of the genetic variance (= QTL + polygenic). 
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Note that the proportion of the variance explained by the full model (l-r^error) is of the 
order of 70% for the yield traits and 80% for the percentage traits, which is in 
agreement with the known reliabilities of the corresponding DYDs (Van Raden and 
Wiggans, 1991). An interesting feature of this QTL effect is that the ""(f to "Q' 
5 substitution increases fat yield, while decreasing milk and protein yield, despite the 
strong overall positive correlation characterizing the three yield traits. 

Table 2: 

Effect of the DGATl K232A mutation on sire's daughter yield deviations (DYDs) 
iO for milk yield and composition. 



Trait 


a/2 


r^drL 


p-value QTL 


r^polygeiiic 




MiIk^ld(Kgs) 


-158 Kgs 


0.18 


5.00E-35 


0.49 


0.32 


Fat yield (Kgs) 


5.23 Kgs 


0.15 


1.57E-29 


0.55 


0.30 


Protein yield (Kgs) 


-2.82 Kgs 


0.08 


L70B-15 


0.65 


0.26 


Fat% 


0.17% 


0.51 


4.33E-122 


0.29 


0.19 


Protein % 


0.04% 


0.14 


5.05E-28 


0.66 


0.20 



(i) a/2: QTL allele substitution effect on DYD (half breeding value), corresponding in the mixed model to the 
regression coefficient on the number of K alleles in the DGATl K232A genotype, and to a/2, where a is defined 
according to ref. Falconer and Madcay, 1996. (ii) r'^ru proportion of the trait variance explained by the DGATl 
15 K232A polymorphism, (iii) p-value statistical significance of the DGATl K232A effect (iv) r^poiyeTOic: proportion 
of the trait variance explained by the random, polygenic effect in the mixed model, (v) r'emt: proportion of the trait 
variance unexplained by the model 

The two previous analyses examined the effect of the DGATl polymorphism on 
20 estimated breeding values. By definition, this phenotype will only account for the 
additive component of the DGATl effect, and justifies the use of a regression on the 
number of K alleles in the mixed model. To evaluate the dominance relationship 
between the DGATl alleles, the effect of the K232A genotype on the lactation values 
(first yield deviations (Van Raden and Wiggans, 1991)) of the cows composing the New 
25 Zealand daughter design were analysed. This was achieved by using a mixed model 
including (i) a fixed effect corresponding to the K232A genotype, and (ii) a random 
polygenic component accounting for all known pedigree relationships ("animal modeF). 
Very significant effects of K232A genotype on all examined yield and composition traits 
were found in this population as well (Table 3, below), accounting for between 1% 
30 (protein yield) and 31% (fat percentage) of the trait variance. The observed dominance 
deviations, d, corresponding to the difference between the genotypic value of the KA 
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genotype and the midpoint between the AA and KK genotypic values (Falconer and 
Mackey, 1996) are shown in Table 3 below. Genotypic values of the heterozygous 
genotype are systematically in between alternate homozygotes. None of the d-values 
proved to be significantly different from zero, indicating an absence of dominance. 
5 Average if to A QTL allele substitution effects, a (Falconer and Mackey, 1996), were 
computed from the estimates of a- and d-values, as well as the population frequencies 
of the K and A alleles (Table 3). The predicted substitution effects are in good 
agreement with those computed from the grand-daughter design: the K allele increases 
fat yield, fat % and protein %, while decreasing milk and protein jdeld. The absolute 

10 values of a estimated from the grand-daughter and daughter design are in perfect 
agreement for fat and protein %, while for the yield traits estimates are larger in the 
grand-daughter design when compared to the daughter design. The exact reasons for 
this are being explored. It could be due to the fact that the sire population in the 
grand-daughter design is not representative of the cow population in general, or to 

15 intrinsic differences between the Dutch and New-Zealand popiilations and/or 
environment. 

Table 3: 

Effect of the DGATl K232A mutation on cows' lactation values 
20 for milk yield and composition. 



Trait 


a 


d 


a 


r2<7a 


P-Val QTL 


r^pofygenk 




Milk yield (Kgs) 


-144 Kgs 


-42 Kgs 


-161 Kgs 


0.03 


1.05E-8 


0.54 


0.43 


Fat yield (Kgs) 


7.82 Kgs 


-0.89 
Kgs 


7.46 Kgs 


0.09 


1.77E-20 


0.46 


0.45 


Protein yield (Kgs) 


-2.34 
Kgs 


-0.76 
Kgs 


-2.64 
Kgs 


0.01 


4.35E-2 


0.37 


0.42 


Fat% 


0.41 % 


0.03 % 


0.42% 


0.31 


2.S£-108 


0.49 


0.20 


Protein % 


0.08% 


0.03 % 


0.08% 


0.04 


1.60E-20 


0.72 


0.24 



(i) a: half the difference between the genotypic values of the KK and AA genotypes (Falconer and Mackey, 1996). (ii) 
d; dominance deviation (Falconer and Mackey, 1996): deviation of the KA genotypic value from the midpoint 
between the AA and KK genotypic values; none of these proved to be significantly different from zero, (iii) a: 
25 average if to A substitution effect, computed as "a + d(q-p)'' (Falconer and Mackey, 1996), where q is the allelic 
frequency of K (=0.7) and pQiA (=0.3) (iv) r^QTL: proportion of the trait variance explained by the DGATl K232A 
polymorphism, (v) p-val qtl: statistical significance of the DGATl K232A effect, (vi) r'ppiycMie: proportion of the trait 
variance explained by the random, polygenic effect in the mixed model, (vii) r^m^: proportion of the trait variance 
unexplained by the model. 

30 
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Pedigree material and phenotypes. The pedigree material "used for the association 
studies comprised a *grand-daughter design" (Weller et. al., 1990) counting 1,818 
Holstein-Friesian bulls sampled in the Netherlands, as well as a "daughter-design" 
(Weller et. al., 1990) counting 529 Holstein-Friesian cows sampled in New Zealand. The 
5 phenotypes of the sires were "daughter yield deviations"(DYD: unregressed weighted 
averages of the daughter's lactation performances adjusted for f systematic 
environmental effects and breeding values of the daughter's dams and expressed as 
deviations from the population mean (Van Raden and Wiggans, 1991)) obtained directly 
from CR- Delta (Amhem - The Netherlands). The phenotypes of the cows were "lactation 
10 values" (first lactation yield deviations (YD), i.e. weighted average lactation 
performances expressed as deviations from the population mean, adjusted for 
management group, permanent environmental effects and herd-sire interaction effects 
(Van Raden and Wiggans, 1991)) obtained directly from Livestock Improvement 
Corporation (Hamilton - New Zealand). 

15 

Combined linkage and linkage disequilibrium analysis and association studies. The 
maximum likelihood procedure for combined linkage and linkage disequilibrium 
analysis is described in detail in Farnir, 2000. The association study in the grand- 
daughter design was performed using the following model: 

20 yi =// + i&^+a, 

where yi is the DYD of son 4 is the overall population mean, p is a fixed regression 
coefficient estimating the A to K allele substitution effect, x, is an indicator variable 
corresponding to the number of K alleles in the K232A genotype, Oi is a random 
polygenic component accounting for all known pedigree relationships ("animal model" 

25 Lynch and Walsh, 1997) and e,- is a random residual. The association study in the^ 
daughter design was performed using the model: 

yi + 

where yi is the lactation value of cow i, gi is a fixed effect corresponding to the DGATl 
genotype (KK, KA, or AA), cu is a random polygenic component accounting for aH known 
30 pedigree relationships ("animal . model" Lynch and Walsh, 1997) and ei is a random 
residual. In both instances, maximum likelihood solutions for p, gi ,0i, ei, o^a, cr^e were 
obtained using the MTDFREML program (Boldman et al, 1997). 
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3. Genotype Testing and Analysis K 

This summarises the genotype testing and subsequent analysis of Holstein-Friesian, 
Jersey and Ayrshire animals in a separate population from those presented in genotype 
testing and analysis I, above. 

Progeny tested sires 

Each year Livestock Improvement Corporation (New Zealand) progeny test some 200- 
300 bulls per year. This entails the bulls being genetically evaluated on the basis of 50- 
85 daughters per sire. The sires are evaluated for milk fat, milk protein, milk volume 
and 20 non-production traits. Semen has been retained from all progeny tested sires 
since the early 1970s. DNA was extracted from the semen and genotyped for the K232A 
DGATl polymorphism using the 7900 Taqman system (see Methods section below). 

Statistical analysis was undertaken on this dataset using Restricted Maximum 
Likelihood (REML) and the average information algorithm (Johnson and Thompson, 
1995). The linear model included the fixed effects of DGATl (3 classes; 0, 1 and 2 
copies of the Q allele i.e. the K residue) and a covariate corresponding to the proportion 
of overseas genetics. The random effect was animal with a relationship matrix based on 
all known relationships. Daughter yield deviations (DYDs), weighted averages of a sire's 
daughter's lactation performances expressed as deviations from the population mean 
(van Raden and Wiggans 1991) were used as the phenotypic measurement. The 
phenotypes were weighted by a weighting factor based on the variance of the DYD for a 
son being: 



Var DYD^ 



where Var DYD is the variance of son's DYD; n is the number of daughters contributing 
to the DYD; h^ is the heritability, which was taken as 0,35 for yield traits. 
The dataset was analysed separately for the 3 major breeds; Holstein-Friesian, Jersey 
and Ayrshire. 
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Seventeen hundred and thirteen Holstein-Friesian sires were included in the analysis. 
The effect of the DGATl polymorphism was extremely significant for the three milk 
production traits (Table 4). With each additional Q allele the level of milk fat production 
increases by approximately 6 kg per lactation, milk protein production decreases by 
5 approximately 2.5 kg per lactation and milk volume decreases by approximately 125 
litres per lactation. 

Table 4: 

Effect of the DGATl polymorphism on milk production in the Holstein-Friesian 
10 bull population (kilograms per lactation). 





Fat 


Protein 


Milk 


qq 


0 


0 


0 


Qq 


6.86 


-2.13 


-128 


QQ 


11.83 


-4.80 


-266 


st. error 


0.87 


0.68 


24 



The effects for the Jersey and Ayrshire breeds were less significant than those of the 
Holstein-Friesian breed but were consistent in direction of effects. 

15 

Daughters for milk components 

Data collection was integrated with LIC's herd testing service using a sample of 102 
herds involved in LIC's Sire Proving Scheme (BPS) in 1995. In addition to milk volume 
from herd testing, the concentrations of fat, crude protein (total nitrogen), casein, whey 

20 and lactose were determined. The data was collected from over 3,000 cows bom in 1996 
and iirst calving in the 1998 spring season, these being predominantly the daughters of 
approximately 220 SPS bulls. The milk characteristics were measured at three herd 
tests on each cow, with each herd having a herd test in each of the Sept/ Oct, Nov/ Dec 
and Jan/Feb periods. The Milkoscan FT120, which employs Fourier transform infrared 

25 spectrophotometry with enhanced milk calibrations (Foss Electric Application Note Nos. 
95, P/N 492280 and 102, P/N 578377), was used to determine the milk component 
concentrations. 

Nine hundred and twelve daughters were genotyped for the DGATl polymorphism using 
30 the OLA system. Analysis was undertaken using SAS (Statistics, Version 5, 1985) 
fitting a general linear model. The model included sire and maternal grandsire as fixed 
effects, DGATl polymorphism (3 classes; 0, 1 and 2 copies of the Q allele), covariates 
including 16ths of Holstein-Friesian, Jersey, Ayrshire and other, proportion of overseas 
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genetics within the Holstein-Friesian, Jersey and Ayrshire breeds. Yield deviations that 
were pre-adjusted for herd, stage of lactation among other fixed effects were used 
(Johnson et al 2000). 

5 The DGATl polymorphism is statistically significant for Lactose, casein, beta-casein 
and whey yield and also for casein and beta-casein percent as outlined in Table 5. 

Table 5: 

EfTect of the DGATl polymorphism on milk componeiits. 

10 



Trait 


qq 


Qq 


QQ 


p-value 


Lactose yield* 


48 


23 


0 


<0.0001 


Casein yield* 


11.0 


5.8 


0 


0.01 


Casein % 


-0.13 


-0.06 


0 


<0.0001 


Whey yield* 


6.86 


2.31 


0 


<0.0001 


P-casein yield* 


3.98 


2.19 


0 


0.05 


p -casein % 


-0.43 


• -0.23 


0 


0.0001 



* Units = g/day for lactose, casein and whey yield and g/litre for ^-casein yield 

Daughters for solid fat content 

15 Six hundred and ninety- two daughters were phenolyped for solid fat content. Solid fat 
content of the milkfat is a characteristic which has a major influence on the 
functionality of milkfat products, and in particular has a significant effect on the 
hardness of butter (MacGibbon & McLennan, 1987). The solid fat content at 10"C (SFC 
10) was used for comparison of the properties of the milkfat as it relates well to the 

20 sectility hardness measurement of butter, a major functional property. Thus the 
performance of milkfat products may be predicted from the characteristics of the milk 
produced. The solid fat content (SFC) of the extracted fat was determined by pulsed 
nuclear magnetic resonance (NMR) and expressed as percentage solid fat (MacGibbon 85 
McLennan, 1987). As the milkfat was melted to remove any thermal history, prior to 

25 reciystallization under standard conditions, the SFC simply reflects the chemical 
composition of the milkfat. 

. The 692 daughters were a subset of the 912 daughters that were phenotyped and 
genotyped for the results presented in Table 2. The solid fat content measures were 
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collected over 2 lactations. Breeding values were calculated using an animal model 
similar to that of Johnson et al 2000. 

The same statistical model was fitted for solid fat content as was for the milk 
5 component analysis. The DGATl polymorphism has a statistically signiiicant effect (p- 
value <0.0001) on solid fat content, increasing it by approximately by 1% for each 
addition of the Q allele. 

This effect was further confirmed in 50 daughters (predominantly Holstein-Friesian) 
10 that were farmed at one location and measured for SFC on the same day.. The 
estimated effect for of the DGATl polymorphism on SFC was to increase it by 
approximately 2% per addition of each Q allele. This finding was significant at the five 
percent threshold level. 

15 The genetic standard deviation for SFC is 2.25 (D Johnson personal communication) 
and thus the effect of DGATl is approximately 0.5 of a genetic standard deviation. 

4. Relative Transcript Levels of the Splice Variant 

20 Real time PCR experiments were conducted using reverse transcribed mRNA isolated 
from lactating bovine mammary gland(s) (see experimental methods). These 
experiments revealed that the alternatively spliced transcript as shown on Figure 2b, 
was approximately 100 fold less abundant than the full length transcript. 
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METHODS SECTION 

In order to identify other polymorphisms within the bovine DGATl gene, DNA was 
5 isolated from sperm, PGR amplified and then using primers designed from the sequence 
shown in Figures 2a and 2b and/or the cDNA sequence (SEQ ID NO: 4) direct 
sequenced on an ABI 3100. The breeds examined were: 

Ayrshire, Angler, Belgian Blue, Blond D'Aquitaine, Brown Swiss, Charolais, Red Devon, 
10 Devon, Dexter, Friesian, Guernsey, Belted Galloway, Gelbvieh, Hereford, Jersey, 
Limousin, Longhom, Maine Anjou, MRI (Meuse-rhine-yssel), Murray Grey, Piedmontese, 
Rpmangola, Sahiwal, Santa Gertrudis, Scottish Highland, Shorthorn, South Devon, 
Sussex, Swedish Red, Simmental, Wagyii, Welsh Black, Angus, and. Zebu. 

15 Ail the polymorphisms discovered are listed in Table 1, above. 

The majority of the primers are also Hsted in Figure 2b or contained in the cDNA 
sequence (SEQ ID NO: 4). 

20 Experimental method for the OLA analysis of four SNP's in DGATl 

PCR amplification of the regions containing the polymorphisms 

Protocol for the PCR amplification of exon VIII, intron XII and 3' UTR, the regions 
containing the four polymorphisms that were initially described in the DGATl gene. 

25 



Component 


For 1 sample 


Final concentration 


HotStar Qiagen Buffer (10 x) 


1.5 Ml 


0.7 mM 


Primer 


17F at IOOmM 


0.07^1 


0.7mM 




ISRat 10>M 


0.07 Ml 


0.5 mM 


Primer 


6F at lOOjiM 


O.OSmI 


0.5 mM 




AW 446985dnl at lOOpM 


O.OSmI 


0.5 mM 


Primer 


InsUpl 


O.OSjil 


0.5 MM 




14R2 


O.OSiil 


10% 


DMSO 




^ iMl 


300 MM 


dNTP lOmM 


0.3 Ml 


0.1 U/m1 


HotStarQiagen Taq (CatNr 203205:5U/|il) 


0.2 Ml 


H20 




1.66 Ml 




DNA (5nK/ui) 


5m1 




Total 


lOMl 





Primer sequences are given in the following table as well as the genomic region targeted 
by them. 
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SNP targeted 


Primer name 


Primer sequence 


5 * base position 


ExonVIIISNP(DG 1) 


17F 


CCTGAGCTTGCCTCTCCCACAGT 


6579 . 




18R 


CCAGGAGTCGCCGCAGCAGGAAG 


7058 


ExcMiXUSNPs 


6F 


CCGGCCATCCAGAACTCCATGAAG 


7280 


(DG 2 and DG3) 


AW446985 dnl 


TAGAACTCGCGGTCTCCAAAC 


7605 




InsUpl 


TGGCTGTCACTCATC\TCGGGCA 


8222 


3VTR SNP (DG4) 


14R2 


TTGCACAGCACTTTATTGACACA 


8566 



PGR amplification was performed on MJ PTC 100 or PTCT200 cyclers using the following 
steps: 



step . 


Temperature 


Time 


Comment 


1' Activation of the enzyme 


94 'C 


12 minutes 


One times 


2' Denaturation 


92 °C 


1 minute 


Repeat step 2 to 4, 35 


3' Hybridisation 


60*C 


1 minute 30 seconds 


times 


4* Elongation 


72'C 


1 minute 30 seconds 




5** InactivatLon of the enzyme 


99-C 


45 minutes 





Oligonucleotide Ligation Assay (OLA) 

The oligonucleotides used in the OLA multiplex reaction are given in the table below. 
The detection of each mutation relies on the use of two fluorescent-labelled 
oligonucleotide (SNPx_FAM and SNPx_HEX) and one common 3' and 5* phosphorylated, 
10 non-labelled oligonucleotide (SNPx_2P) 



Locus 


Oligo 


Sequence 


5' base 
position 


Number of spacer 
phosphoramidites 


Size of the 

ligation 

product* 


DGl 


SNP1_FAM 
SNPl.HEX 
SNP1_2P 


AGO TTT GGC AGG TAA GGC 
AGC TTT GGC AGG TAA GAA 
GGC CAA CGG GGG AG 


6813 
6813 
6831 


0 


32 


DG2 


SNP2_FAM 
SNP2_HEX 
SNP2_2P 


GCT GGC GGT GAG TGA 

GCT GGC GGT GAG TOG 
CCT GCT GGG TGG GGA 


7424 

7424 
7439 


3 


39 


DG3 


SNP3_FAM . 

SNP3_HEX 

SNP3_P 


GCT GGG TGG GGA CGC 
GCT GGG TGG GGA CGT 

GTG GGG GCG GGT GG 


7442 
7442 

7457 


0 


29 


DG4 


SNP4_FAM 
SNP4_HEX 
SNP4 2P 


TGC CCC AAC CTG GGT 
TGC CCC AAC CTG GGC 
GCA GCA GGA GGA GGC 


8388 
8388 
8403 


2 


36 



* The size of the ligation products is the sum of the number of nucleotides of the two ligated oligonucleotides plus 3 
bases equivalents per spacer phosphoramidites molecule, present at the 5' end of the common oligonucleotide. 

For each SNP a mixture of the three oligonucleotides was prepared first, following the 
dilution guidelines in the table below. 
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SNP mixture 


Oligonucleotide to mix 


Quantity 


Final concentratioii 


DGl (oligo. mixture) 


SNPl^FAM 10 |iM 


10 Ml 


1 mM 




SNP1_HEX 10 jiM 


20m1 


2mM 




SNP1_2P 10 fiM 


20m1 


2 mM 




H2O 


Crt 111 




DG2 (oligo. mixture) 


SNP2_FAM 10 uM 


10 Ml 


1 mM 




SNP2,HEX 10 uM 


20m1 


2 mM 




SNP2_2P 10 nM 


20m1 


2mM 




H2O 


50 Ml 




DG3 (oligo. mixture) 


SNP3_FAM 10 pM 


10 Ml 


1 mM 




SNP3_HEX 10 mM 


20m1 


2 mM 




SNP3_2P 10 pM 


20m1 


2mM 




H2O 


50 Ml 




DG4 (oligo. mixture) 


SNP4_FAM 10 tiM 


10 Ml 


1 m 




SNP4_HEX 10 m 


30 Ml 


3 mM ' 




SNP4_2P 10 |iM 


20 Ml 


2mM 




H2O 


40 Ml 





The ligation reaction for one sample was performed as follow: 



Component 


Quantity persainple 


DGl oligonucleotide mixture* (35, 70 and 70 nM) 
DG2 oligonucleotide mixture* (12.5, 25 and 25 nM) 

DG3 oligonucleotide mixture* (12.5, 25 and 25 nM) 

DG4 oligonucleotide mixture* (12.5, 37.5 and 25 nM ) 
DMSO 

Incubation buffer of the Tsc DNA ligase (Roche, Cat Nr 1 939 807 or 
1 939 815) 
Tsc DNA ligase 

H2O 

Multiplex PGR (see above) 


0.7 Ml 
0.25 Ml 
0.25 Ml 
0.25 Ml 
2m1 
2m1 

iMl 
8.55 Ml 
5m1 


Total 


20 Ml 



5 

* The final concentration of the oligonucleotides in the Hgation reaction is given between parenthesis (SNP^^FAM, 
SNPxlHEX and SNPx-2P respectively) 

The sample was submitted to the following temperature cycling program in a MJ 
10 PTClOO or PTC 200 PGR machine. 



Step 


Temperature 


Time 


Comment 


1* Initial denaturation step 


98 "C 


2 minutes 


One times 


2* Denaturation 


94 


30 seconds 


Repeat step 2 to 3, 30 


3" Hybridisation and hgation 


45'*C 


3 minutes 


times 


5* Inactivation of the enzyme 


99-C 


45 minutes 





Following the LCR, 20 ill of H2O was added to the ligation reaction. To 0.5 ]xl of the 
diluted ligation reaction, either 2 |il of loading buffer was added, or 2m1 loading buffer 
15 containing TAMRA350 internal line size standard- 



The loading buffer was composed as follows: 1 part of blue dextran (50mg/ml)/ EDTA 
(25mM) and 6 parts of formamide 
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The TAMRA350 containing loading buffer was composed as follows: 3 parts TAMRA350 
(Applied Biosystems 401736 ; 8nM), 10 parts of Bine dextran (50 mg/ml)/EDTA (25 
mM) and 60 parts of formamide. 

5 TAMRA containing samples was placed alternately with TAMRA free samples when 
loaded onto the sequencing gel, in order to ease the identification of the lanes on the gel 
image. 

The samples may require further dilution in order to avoid a too intense fluorescent 
10 signal on the sequencer. It is also veiy likely that from one primer batch to another, 
oligonucleotides concentrations will need adjustment. 

The samples were denatured for 5 minutes at 95'C before loading. The samples were 
then loaded onto a 6% denaturing acrylamide gel on sequencer ABI 373 or a 4% gel on 
1 5 sequencer ABI 377 . 

In addition to the OLA assays referred to above, genotyping of the DGATl polymorphism 
was carried out by utilizing two different techniques for detection of PGR products. 

Gel-based Genotyping Assay 
20 Primer sequences 5' to 3', genomic sequence position in brackets: 



25 



DGATl 21 
DGATl 22 
DGATl 23 



GTAGCTTTGGCAGGTAAGAA (6811) 
GGiSGCGAAGAGGAAGTAGTA (6984) 
TGGCCCTGATGGTCTACACC (6613) 



DGATl 24B: GGGCAGCTCCCCCGTTGGCCGC (6850) 



The fmal reaction conditions were IX Gold PGR buffer, 2.5mM MgCh (Applied 
Biosystems), 200mM each dNTP (Roche), 600nM DGATl 21 and DGATl 22, 400nM 
DGATl 23 and DGATl 24B (Invitrogen), 10% dimethylsulphoxide (Sigma), 3ial DNA 
template and 2.5 units AmpliTaq Gold DNA polymerase (Applied Biosystems) in a total 
30 volume of 50^1. 

Cycling conditions were a 94°C initial denaturation for 5 minutes, then 35 cycles of 
denaturation at 94^*0 for 30 seconds, annealing at 56 °C for 30 seconds, extension at 
72''C for 20 seconds followed by one extension cycle of 72°C for 2 minutes. 

35 
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Primer positions around polymorphism (in bold) on genomic sequence from 6587 to 
6986. 

DGAT123 TGGC CCTGATGGTC TACACC 
5 TGCCTCTCCC ACAGTGGGCT CCGTGCTGGC CCTGATGGTC TACACCATCC 

TCTTCCTCAA GCTGTTCTCC TACCGGGACG TCAACCTCTG GTGCCGAGAG 

10 

CGCAGGGCTG GGGCCAAGGC CAAGGCTGGT GAGGGCTGCC TCGGGCTGGG 
GCCACTGGGC TGCCACTTGC CTCGGGACCG GCAGGGGCTC GGCTCACCCC 

15 

DGATl 21 GTAGCTTTGGCAGGTAAGAA 

► 

CGACCCGCCC CCTGCCGCTT GCTCGTAGCT TTGGCAGGTA AGAAGGCCAA 

^ CGCCGGTT 

20 CGGGGGAGCT GCCCAGCGCA CCGTGAGCTA CCCCGACAAC CTGACCTACC 
GCCCCCTCGA CGGG DGATl 24B 

GCGGTGAGGATCCTGCCGGG GGCTGGGGGG ACTGCCCGGC GGCCTGGCCT 

25 

GCTAGCCCCG CCCTCCCTTC CAGATCTCTA CTACTTCCTC TTCGCCCCCA 

< AT GATGAAGGAG AAGCGGGG DGATl 22 

The Q allele has polymorphic sequence AA and is detected by the DGATl 21+22 
30 primers, producing a band of 174bp. The q allele has polymorphic sequence GC and is 
detected by the DGAT123 + 24 primers, producing a band of 238bp. 

The primers DGAT123 and DGAT122 also successfully PCR the DGATl gene producing 
a product of 372bp in all reactions. Therefore, a QQ homozygote would have bands at 
35 372bp and 174bp, a qq homozygote would have bands at 372bp and 238bp and a Qq 
heterozygote would have all 3 bands at 372bp, 238bp and 174bp. 



wo 02/36824 



PCT/NZOl/00245 



39 

18|il of PGR product was separated on a 1.2% agarose TAE gel, stained with ethidium 
bromide and scored independently by two investigators on the basis of the number and 
size of bands present. 

TaqMan Allelic Discrimination Genotyping Assay 
5 Primer sequences 5' to 3', genomic sequence position in brackets: 
DGATlforAD: TTCTCCTACCGGGACGTCAA (665 1) 
ReverseNZ: CCGCGGTAGGTCAGGTTGTC (6890) 

Probe sequences 5' to 3', genomic sequence position in brackets: 
10 ForAA (FAM): CGTTGGCCTTCTTA (6838) 

DGATIADGC (VIC): TTGGCCGCCTTACC (6836) 

Both probes use MGB (minor groove binder) as a non-fluorescent quencher. 

The final reaction conditions are Ix Universal PGR Mastermix (Applied Biosystems), 
15 SOOnM each primer (Invitrogen), 70nM ForAA (FAM) probe, 300nM DGATIADGC (VIC) 
probe (Applied Biosystems) and 2pl of a 1/20 dilution of DNA template in a total volume 
of lO^il, 

Cycling conditions were 50*C for 2 minutes, 95°C initial denaturation for 10 minutes, 
then 37 cycles of denaturation at 94*C for 15 seconds, annealing and extension 60**C 
20 for 1 minute. 

Primer positions around poljonorphism (in bold) on genomic sequence from 6587 to 
6986. 

25 TGCCTCTCCC ACAGTGGGCT CCGTGCTGGC CCTGATGGTC TACACCATCC 

DGATlforAD TTCTCCTACCGGGACGTCAA 
TCTTCCTCAA GCTGTTCTCC TACCGGGACG TCAACCTCTG GTGCCGAGAG 

' 30 

CGCAGGGCTG GGGCCAAGGC CAAGGCTGGT GAGGGCTGCC TCGGGCTGGG 
GCCACTGGGC TGCCACTTGC CTCGGGACCG GCAGGGGCTC GGCTCACCCC 

35 
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ForAA(FAM) A T TCTTCCGGTTGC 

CGACCCGCCC CCTGCCGCTT GCTCGTAGCT TTGGCAGGTA AGAAGGCCAA 

DGATIADGC (VIC) CCAT TCCGCCGGTT 



5 CGGGGGAGCT GCCCAGCGCA CCGTGAGCTA CCCCGACAAC CTGACCTACC 

CTGTTG GACTGGATGG 

GCGGTGAGGA TCCTGCCGGG GGCTGGGGGG ACTGCCCGGC GGCCTGGCCT 
CGCC ReverseNZ 

10 

GCTAGCCCCG CCCTCCCTTC CAGATCTCTA CTACTTCCTC TTCGCCCCCA 

A 240bp product is produced in this reaction. When the Q allele (AA) is present the 
FAM-labelled probe binds and fluoresces at SlSnm. When the q allele (GC) is present 

15 the VIC-labelled probe binds and fluoresces at 554nm. After cycling is complete, the 
plate is scanned on the ABI7900 Sequence Detection System, the fluorescence from 
each well detected, and a scattergraph is drawn. The scattergraph separates out into 3 
clumps with Q homo2ygotes in the upper left hand corner, q homo2ygotes in the lower 
right hand comer and Qq heterozygotes in between. Each clump is circled and the 

20 software automatically determines the genotype for each sample. On each plate there 
are controls with 8 wells each of known Q homozygotes, q homozygotes, Qq 
heterzygotes and no template controls. 

Splice Variant Gene Expression 
25 To determine the relative gene expression of the splice variants created by 
insertion/ deletion of 66bp around the polymorphic site by alternate exon usage, RNA 
was extracted from mammary tissue and reverse transcribed using oligodT primer using 
a first strand cDNA synthesis kit (Invitrogen). Real time PGR to determine relative 
quantities of each variant was then carried out. 

30 

Primer. sequences 5' to 3', genomic sequence position in brackets: 
DGATlforRT66: TCTCCTACCGGGACGTCAAC (6652) 
DGATlrevRT66: GAGATCGCGGTAGGTCAGGTT (6964) 
DGATlforRTless66: GCTGCTTTGGCAGATCTCTACTACTT (6711) 
35 DGATlrevRTless66: AAGCGCTTTCGGATGCG (7038) 



wo 02/36824 



PCT/NZOl/00245 



41 

Probe sequences 5' to 3', genomic sequence position in brackets: 
DGATlwith66 (FAM)r CCGTGAGCTACCC (6857) 

DGATlless66 (VIC): CTTCGCCCCCACCCT (6976) 
5 Both probes use MGB (minor groove binder) as a non-fluorescent quencher. 

Final reaction conditions were IX Universal PGR Mastermix (Applied Biosystems), 60nM 
each primer (Invitrogen), 60nM each probe (Applied Biosystems) and liii of template 
cDNA in a total volume of 10|j1. 
10 Cycling conditions were SO°C for 2 minutes, 95°C initial denaturation for 10 minutes, 
then 37 cycles of denaturation at 94*'C for 15 seconds, annealing cind extension 60°C 
for 1 minute. 

Primer positions around 66bp insertion (in italics) on cDNA sequence. The start of the 
15 cDNA sequence is equivalent to position 6479 on the genomic sequence, with the last 
base of the cDNA equivalent to position 7428 of the genomic sequence, 

CCGTGGCCTTTCTCCTCGAG TCTATCACTC CAGTGGGCTC CGTGCTGGCC 

20 DGATlforRT66 TCTCCT ACCGGGACGT 

CTGATGGTCT ACACCATCCT CTTCCTCAAG CTGTTCTCCT ACCGGGACGT 

CAAC_^ DGATlforRTless66 GCTGCTT 

CAACCTCTGG TGCCGAGAGC GCAGGGCTGG GGCCAAGGCC AAGGCTGCTT 

25 

TGGCAG DGATlwith66(FAM) C CGTGAGCTAC 

TGGCAGGTAA GAAGGCCAAC GGGGGAGCTG CCCAGCGCAC CGTGAGCTAC 

CO ATCTCTAC TACTT 

30 CCCGACAACC TGACCTACCG CGATCTCTAC TACTTCCTCT TCGCCCCCAC 
^ TTGG ACTGGATGGC GCTAGAGDGATlrevRT66 CT TCGCCCCCAC 

CCTGTGCTAC GAGCTCAACT TCCCCCGCTC CCCCCGCATC CGAAAGCGCT 
CCT DGATlless66 (VIC) ^ GCGTAG GCTTTCGCGA 

35 

TCCTGCTGCG GCGACTCCTG GAGATGCTGT TCCTCACCCA GCTCCAGGTG 
A DGATlrevRTless66 
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GGGCTGATCC AGCAGTGGAT GGTCCCGGCC ATCCAGAACT CCATGAAGCC 



5 CTTCAAGGAC ATGGACTACT CCCGCATCGT GGAGCGCCTC CTGAAGCTGG 

This reaction detects the presence of the insertion splice variant by creating a 145bp 
product which binds the FAM probe only. The deletion splice variant is detected by a 
92bp product that binds the VIC probe only. 

10 

The cDNA for each alternate splice variant was cloned into pGemT (Promega). A 
dilution series of the same, known amount, of each variant plasmid DNA was used to 
create a standard curve that established the linearity of the PGR reaction over a range 
of DNA concentrations. The threshold cycle number of the sample variants was 
15 converted back to a DNA amount by linear regression and the amounts of each variant 
present compared. 

The presence of an alternate spice variant raises the possibility of an alternate function 
that is at this stage unknown. 

20 

It will be appreciated that it is not intended to limit the invention to the above examples 
only, many variations, which may readily occur to a person skilled in the art, being 
possible without departing from the scope thereof as defined in the accompanying 
claims. 

25 

INDUSTRIAL APPLICATION 

The present invention is directed to a method of genotyping bovine for improved milk 
production traits. In particular, such traits include increased milk volume and milk 
30 protein content and decreased milkfat content and solid fat content. It is anticipated 
that herds of bovine selected for such a trait will produce milk which will be more easily 
processed and such milk and products made therefrom may provide health benefits to 
consumers, as well as producing an increased milk yield. 
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SEQUENCE LISTING 

<110> REID, SUZANNE J 
FORD, CHRISTINE A 
GEORGES, MICHEL A J 
COPPIETERS, WOOTER H R 
GRISART, BERNARD M J J 
15 SWELL, RUSSELL G 

SPELMAN, RICHARD . J 



10 



20 



<120> MARKER ASSISTED SELECTION OF BOVINE FOR IMPROVED MILK COMPOSITION 



25 <130> P452508 CJE 
<160> 21 

30 

<170> Patentin version 3.1 

35 

<210> 1 
<211> 11771 

40 

<212> DNA 

<213> Bos tauanis 

45 

<220> 

<221> CDS 

50 

<222> {1)..(191) 

<223> Exon 1 CDS, determined by alignment with an amino acid sequence d 
educed from the cDNA 

55 

<220> 

60 <221> misc_signal 
<222> (1)..(4) 



<223> these bases correspond to bases 4 to 7 of the Kozak recognition s 
65 equence. See DGATl'cDNA for the complete recognition sequence. 
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<220> 

5 <221> CDS 

<222> (3809) . . (3896) 

<223> Exon 2, determined by alignment with an amino acid sequence deduc 

10 ed from the cDNA 



<220> 

15 

<221> CDS 

<222> (5840) . . (5880) 

20 <223> Exon 3, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 



25 <220> 

<221> CDS 

<222> (5960) .. (6045) 

30 

<223> Exon 4, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 



35 

<220> 

<221> CDS 

40 <222> (6138) (6190) 

<223> Exon 5, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

45 

<220> 
<221> CDS 

50 

<222> (6406) (6511) 

<223> Exon 6, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

55 



<220> 
60 <221> CDS 

<222> (6601) (6714) 



<223> Exon 7, determined by alignment with an amino acid sequence deduc 
65 ed from the cDNA 
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<220> 

5 <221> CDS 

<222> (6B15) . . (6889) 

<223> Exon 8, determined by alignment with an amino acid sequence deduc 

10 ed from the cDNA 

<220> 

<221> variation 

<222> (6824) . . (6889) 

<223> this sequence is deleted from the alternately spliced transcript. 

<220> 
25 

<221> CDS 

<222> (6960).. (7063) 

30 <223> Exon 9, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

35 <220> 

<221> CDS 

<222> (7154).. (7192) • 

40 

<223> Exon 10, determined fay alignment with an amino acid sequence dedu 
ced from the cDNA 



15 



20 



45 

<220> 

<221> CDS 

50 <222> (7271) .. (7312), 

<223> Exon 11, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 

55 



<220> 



<221> CDS 

60 

<222> (7386) . . (7430) 

<223> Exon 12, determined by alignment with an amino acid sequence ded 
uced from the cDNA 

65 
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<220> 

<221> CDS 
5 <222> (7505) .. (7617) 



10 



15 



20 



35 



50 



55 



65 



<223> Exon 13, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 



<220> 

<221> CDS 

<222> (7705) (7770) 

<223> Exon 14, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 



<220> 

25 <221> CDS 

<222> (7858) .. (7945) 

<223> Exon 15, determined by alignment with an amino acid sequence dedu 

30 ced from the cDNA 



<220> 

<221> CDS 

<222> (8027) (8089) 



40 <223> Exon 16, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 



45 <220> 

<221> CDS 

<222> (8162) . . (8314) 



<223> Exon 17 CDS, determined by alignment with an amino acid sequence 
deduced from the cDNA 



<220> 

<221> misc__feature 

60 <222> (8315) (8317) 

<223> translation stop codon 



<220> 
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poiyA^site 
(8572) (8578) 

<220> . 

10 

<221> variation 
<222> (626).. (626) 
15 <223> thymidine (T) to guanine (G) substitution polymorphism 

<220> 

20 

<221> variation 
<222> (3512) . . (3512) 
25 <223> thymidine (T) to guanine (G) substitution polymorphism 

<220> 

30 

<221> variation 
<222> (4040) (4040) 
35 <223> thymidine (T) to cytosine (G) substitution polymorphism 

variation 
(4963) (4963) 

adenine (A) to guanine (G) substitution polymorphism 
<220> 

50 

<221> variation 
<222> (5003) . . (5003) 
55 <223> guanine (G) to adenine (A) substitution polymorphism 

<220> 

60 

<221> variation 
<222> (5997) . . (5997) 
65 <223> cytosine (C) to thyamine (T) substitution polymorphism 



<221> 
<222> 
5 <223> 



40 



<220> 
<221> 
<222> 
45 <223> 
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<220> 

5 <221> variation 

<222> (6829) (6830) 

<223> adenine (A) -adenine (A) to guanine (G) -cytosine (C) substitution p 

10 olymorphism ^ 

AA corresponds to the Q allele 
GC corresponds to the q allele 

15 

<220> 

<221> variation 

20 <222> (6892) (6892) 

<223> guanine (G) to adenine (A) substitution polymorphism 

25 

<220> 

<221> variation 

30 <222> (7224) (7225) 

<223> guanine (G) -guanine (G) to adenine (A)-cytosine (C) substitution 
polymorphism 
GG-AC 

35 

<220> 

40 <221> variation 

<222> (7438) . . (7438) 

<223> adenine (A) to guanine (G) substitution polymorphism 

45 

<220> 

50 <221> variation 

<222> (7456) , . (7456) 

<223> cytosine (C) to thymidine (T) substitution polymorphism 

55 

<220> 

60 <221> variation 

<222> (7987) . . (7987) 

<223> guanine (G) to adenine (A) substitution polymorphism 

65 
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<220> 

<221> variation 

"5 <222> (8402) (8402) 

<223> ctosine (C) to thymidine (T) substitution polymorphism 

10 

<220> 

<221> misc_feature 

15 <222> (9434) (9434) 

<223> ambiguous nucleotide 

20 

<220> 

<221> misc_feature 

25 <222> (9496) (9496) 

<223> ambiguous nucleotide 

30 

<220> 

<221> misc_feature 

35 <222> (10402) .. (10417) 

<223> ambiguous nucleotides 

40 

<220> . ' 

<221> prime r_bind 

45 <222> (6579) (6601) 

<223> Primer 17F 

CCTGAGCTTGCCTCTCCCACAGT 



50 



55 



<220> 

<221> primer Jbind 
<222> (7036) . . (7058) 



<223> Primer 18R 

CCAGGAGTCGCCGCAGCAGGAAG 
60 reverse primer 



65 



<220> 

<221> primer bind 
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<222> 
<223> 

5 

<220> 
10 <221> 
<222> 
<223> 

15 

20 <220> 
<221> 
<222> 

25 

<223> 

30 

<220> 
<221> 
35 <222> 
<223> 

40 

<220> 
<221> 

45 

<222> 
<223> 

50 

<220> 
55 <221> 
<222> 
<223> 

60 

<220> 

65 

<221> 



54 

(7280) . . (7303) 
Primer 6F 

CCGGCCATCCAGAACTCCATGAAG 



primer_bind 

(7585) . . (7605) ^ 

Primer AW44 6985 dnl 
TAGAACTCGCGGTCTCCAAAC 
reverse primer 



prime r_bind 

(8222) . . (8244) 

Primer InsUpl 
TGGCTGTCACTCATCATCGGGCA 



prime r_bind 
(8566) . . (8589) 
Primer 14R2 

TTGCACAGCACTTTATTGACACA 



primer_bind 

(6813) . . (6830) 

Primer SNP1_FAM 

AGC TTT GGC AGG TAA GGC 



primerjDind 

(6813) . . (6830) 

Primer SNP1_HEX 

AGC TTT GGC AGG TAA GAA 



prime r_bind 
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<222> 
<223> 

5 

<220> 
10 <221> 
<222> 
<223> 

15 

<220> 

20 

<221> 
<222> 
25 <223> 

30 <220> 
<221> 
<222> 

35 

<223> 

40 

<220> 
<221> 
45 <222> 
<223> 

50 

<220> 
<221> 

55 

<222> 
<223> 

60 

<220> 
65 <221> 



55 

(6831) . . (6844) 

Primer SNP1_2P 
GGC CAA CGG GGG AG 



primer__bind 

(7424) . . (7438) 

Primer SNP2_FAM 
GCT GGC GGT GAG TGA 



primer^bind 

(7424) . . (7438) 

Primer SNP2_HEX 
GCT GGC GGT GAG TGG 



prime r_bind 

(7439)... (7453) 

Primer SNP2_2P 

CCT GCT GGG TGG GGA 



primer_bind 

(7442) . . (7456) 

Primer SNP3_FAM 
GCT GGG TGG GGA CGC 



primer_bind 

(7442) .. (7456) 

Primer SNP3_HEX 
GCT GGG TGG GGA CGT 



prime r_bind 
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56 



<222> (7457) .. {7470) 

<223> Primer SNP3_P 

GTG GGG GCG GGT GG 



<220> 

10 <221> primer_bind 

<222> (8388) . . (8402) 

<223> Primer SNP4_FAM 

15 TGC CCC AAC CTG GGT 



20 



<220> 

<221> prime r_bind 
<222> (8388) . . (8402) 



25 <223> Primer SNP4_HEX 

TGC CCC i\AC CTG GGC 



30 <220> 

<221> primer bind 



35 



4b 



50 



55 



65 



<222> (8403) . . (8417) 

<223> Primer SNP4_2P 

GCA GCA GGA GGA GGC 



<220> 

<221> primer_bind 
45 <222> (6811) .. (6830) 



<223> Primer Dgat 21 

GTAGCTTTGGCAGGTAAGAA 



<220> 

<221> primer_bind 
<222> (6965) . . (6984) 



<223> Primer Dgat 22 

GGGGCGAAGAGGAAGTAGTA 
60 reverse primer 



<220> 

<221> primer_bind 
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57 



<222> (6613) (6632) 

<223> Primer Dgat 23 

TGGCCCTGATGGTCTACACC 



<220> 

10 <221> prime rjbind 

<222> (6829) (6850) 

<223> Primer Dgat 24B 

1 5 GGGCAGCTCCCCCGTTGGCCGC 
reverse primer 

20 <220> 

<221> primer_bind 

<222> (6651) (6670) 



25 



30 



35 



45 



<223> Primer DgatforAD 

TTCTCCTACCGGGACGTCAA 



<220> 

<221> primer_bind 
<222> (6871) . . (6890) 



<223> Primer ReverseNZ 

CCGCGGTAGGTCAGGTTGTC 
40 reverse primer 



<220> 

<221> primer_bind 
<222> (6825) . . (6838) 



50 <223> Probe ForAA (FAM) 
CGTTGGCCTTCTTA 



55 

<220> 

<221> prime r__bind 

60 <222> (6823) (6836) 

<223> Probe DgatADGC (VIC) 
TTGGCCGCCTTACC 

65 
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58 



<220> 

<221> primer_bind 

5 <222> (6651) (6670) 

<223> Primer DgatforAD 

TTCTCCTACCGGGACGTCAA 

10 

<220> 

15 <221> primerjDind 

<222> (6878) (6972) 

<223> Primer DgatrevAD 

20 AAGTAGTMAGATCGCGGTAGGTCA 
reverse primer 

25 <220> 

<221> primer_bind 

<222> (6825) (6838) 



30 



35 



40 



45 



50 



<223> Probe ForAA (FAM) 
CGTTGGCCTTCTTA 



<220> 

<221> primer_bind 

<222> (6823) (6836) 

<223> Probe DgatADGC (VIC) 
TTGGCCGCCTTACC 



<220> 

<221> primer^bind 

<222> (6652) . . (6671) 

55 <223> Priiier DgatforRT66 

TCTCCTACCGGGACGTCAAC 

60 

<220> 

<221> primer_bind 

65 <222> (6874) (6964) 

<223> Primer DgatrevRT66 
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59 



25 



30 



35 



40 



GAGATCGCGGTAGGTCAGGTT 
reverse primer 



5 

<220> 

<221> prime r_bind 

10 <222> (6711) .. (6972) 

<223> Primer DgatforRTless66, GCTGCTTTGGCAGATCTCTACTACTT 

This primer was designed to selectively bind and amplify the cDNA 
splice variant. The corresponding binding site in this genomic s 
15 equence comprises bases 6711 to 6715, 6815 to 6823 and 6960 to 69 

72. 



20 <220> 

<221> primer_bind 
<222> (7022) . . (7038) 



<223> Primer DgatrevRTless66 
AAGCGCTTTCGfSATGCG 
reverse primer 



<220> 

<221> prime r_bind 

<222> (6857) . . (6870) 

<223> Probe Dgatwith66 (FAM) 
CCGTGAGCTACCC 



<220> 

45 

<221> priraer_bind 

<222> (6976) . . (6990) 

50 <223> Probe Dgatless66 (VIC) 
CTTCGCCCCCACCCT 



55 

<400> 1 

atg ggc gac cgc ggc ggc gcg ggc 
Met Gly Asp Arg Gly Gly Ala Gly 
1 5 

60 

teg egg cct teg ate cag ggc ggc 
Ser Arg Pro Ser lie Gin Gly Gly 
20 

65 gag gtg egg gat gtg ggc gcc gga 
Glu Val Arg Asp Val Gly Ala Gly 
35 40 



ggc tec egg cgc egg agg acg ggg 48 
Gly Ser Arg Arg Arg Arg Thr Gly 
10 15 

agt ggg cec gcg gca gcg gaa gag 96 
Ser Gly Pro Ala Ala Ala Glu Glu 
25 30 

ggg gac gcg ceg gtc egg gac aca 144 
Gly Asp Ala Pro Val Arg Asp Thr 
45 
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gac aag gac gga gac gta gac gtg ggc age ggc cac tgg gac ctg ag 191 
Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
50 55 60 

5 





gtagcggtgc 


gcgtgacccc 


taacctttga 


cccctgatac ggggcccctg 


cgacccaacc 


251 




tggtggccca 


ggcctgtcgg 


cggcagctcg 


ggctcgagtc cgagagtctg 


gcgcctggac 


311 


10 


cttggtgcac 


agctgtgccc 


ctcgggcctc 


cacggggaaa cttagcggga 


ggttgggggc 


371 




ggagggtctc 


ctgcccggaa 


cacccaggta 


cgggggccga ggggagggca 


gcggctcaac 


431 


15 


ttctagacgc 


cctccctctg 


ccttcctttg 


gtgggttctg aagctttccc 


agggtgagcc 


4 91 


cactacgcac 


agtgtcctct 


acctggaagg 


agatacaggg gtccttcctg 


agggctatga 


551 




ggggtgcctt 


gtgggttgat 


aaagctcccg 


ggggaggagg gtggaccggc 


ggagaacaga 


611 


20 


ggcaggggca 


gtgctagggg 


atttctcatc 


cctcgcagac cctccagaga 


atggtcttca 


671 




caaaggtccc 


tcatccgtca 


cccggcgatt 


gactggccta ggatcctgct 


tattaccagc 


731 


25 


acaaatggct 


gctctagggt 


caaagtgggt 


cctgtaatgg gaccctcacc 


cctggttggg 


791 


gtacagggga 


ggagttggaa 


gtgcgcacac 


ccacaggtgg gcgccctgct 


tagctgaagg 


851 




actgatggga 


aggagttggg 


ggagcaagct 


gcggctgaaa gggaggatct 


gacccacgtg 


911 


30 


ggcatcagct 


aagtcctgct 


ggctgcctcc 


aggcgttccc tttgccatcc 


tccacgcccc 


971 




tcccccgggg 


cctgaccttc 


atcctggtca 


agggctctca ggggctctgg 


ttttgggatc 


1031 


35 


agctccagag 


ctagaggtta 


tcaaggagga 


agtgggcaac aggtcagtca 


gcaaggattt 


1091 


gctatcttca 


ctgggtgctg 


tggggagggg 


agggacaagg gcagttgggg 


tgcaggcact 


1151 




gtccctgccc 


ttggggggca 


cacagttcac 


ctgagagata agatagccgc 


agccctgaag 


1211 


40 


agtgagagca 


aaggtcaggc 


acagagttca 


ggatgacacc aggggagggt 


ggctctgtga 


1271 




ggggcactgg 


cttcctacag 


gccccaggtg 


gtcctgaggg ggcggctgca 


aaggccagga 


1331 


45 


ggcccacagg 


cccctctgcc 


cactcctggg 


gaactggatt tggggtcact 


ttgtatgagg 


1391 


tgggggcggg 


taccagcttt 


gggccaagct 


gtcaccctgg atgggccatc 


acttgcctgc 


1451 




tctgtatagg 


ccagatggcc 


agaagctgct 


cctgtcctgt tgatggccca 


tcctcgaggt 


1511 


50 


ctggaccctc 


gggaagagga 


gcagttggtg 


gcagggatgg gccaccggag 


accctcctga 


1571 




cctccaggac 


acgcagctgt 


gtgtgcctgt 


ccccaagcca catgccacat 


ggctaggggc 


1631 


55 


ctcctggggc 


agggctgggc 


attggtctgg 


ctactcttgg tatcgcctat 


ggcttccctg 


1591 


cctcccagtc 


atcatcctcc 


cacctctgcc 


tccctgcctg ttcctctctt 


tctcctcagg 


1751 




cccttccgga 


catttcctgc 


tcacctaggt 


ctgggcaggc ggggtcaggt 


gccgggtgtg 


1811 


60 


agctcactcc 


ttccggcagc 


aaggtgtagc 


tatgtgccgg aaggaaggcc 


gctgctgttg 


1871 




cctcgcctct 


gagtgcatcc 


cttccaggtc 


ctccacactc ccctgtgccc 


cgacacctgg 


1931 


65 ' 


tgcgtccttc 


agccattggt 


tcatgtgtcc 


tccaggcaca. gctttctagt 


ccagagcctc 


1991 


taggctgggt 


gcaggaagtg 


ctgaggaagt 


ggcagccggg aggcgagctg 


gcaccctgtc 


2031 



10 
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61 

cctccttgtt ctgtccgtcc ctgcccctgg accgtatggc cccgcatgtg tgatccccac 2111 

ttggggctgt gcctctgggc aagttgggaa gcttggtgag cctcattttc atgtgcccgc 2171 

ctcccagtac tgatgtgcag gttgaatgag gtgccaactg taatgagttg gaatggccct 2231 

gctggctggg tgggactggg gagcaggtgg gggccgctgg ggggcacaga ggcacaccca 2291 

gtgcctcagt cagggagagg gtgacagaga agctctgggt gaggccccac ctccactctg 2351 

gccatggctg ctgccctttg gtccactgca gtgaactgtg ccatggggct ggacctctgt 2411 

ggggattggt gggcagtggg ctttcttccc gcttggggcc tctgacctct gggggcaggg 2471 

15 cgctgcccgg gtgggacagt cggaaggctg gtagagggac ctgaggggtc tgtgtggtgg 2531 

ctgggggcag gcctcaggaa tttgacagca gggatctgga aaagctttaa taacattatt 2591 

tgttgtcagg attgggaaat gctcccctcc cccctccccc tctttcatct tagagactgc 2651 

20 

tgcacatctg gtcagtgtgg tcttcttggt ggcccccaag gtggcagggg tcacactgtt 2711 

atgaaaccgt cccctgggta tgtggtgcag acatgcacat gcagatggtg attggcaggt 2771 

25 tgtagcatga ggtggctttg ggacggttcc agtgacagtg agtgggctgg atctgggggg 2831 

ttctgggcag gtccatcaag cggatacccc cacagactgt cctcttggga tagttgggcc 2891 

tgggagccct gcttgccttg ccaaaaggca ggcgcagagt catgaagaag agggcttggg 2951 

30 

ggctcagagc cccactgtgt gtgcagccca gggtggacct ggaggaggtg cgtgggcagg 3011 

ctgggccggc cggggcgggg ggtggggggg cctggtgtga aagggaccca gggccagact 3071 

35 gtcagcgctg cctggctgag gatgctggca ccctgtcctc cccagccgtc tgtctcctgg 3131 

gtgcagccat ctgagtgctg accccagccg cccctggagg ctggctgttc tcctgtgccc 3191 

tattgctggg gacatgtgtc cacaggaggg aaagggaagc cccggcctct ccccttacaa 3251 

40 

aactggaggc cttgctcaat gccctggatg gcctcctggt ggcagggtgg ttggtgggag 3311 

gtggggctgc tgcttagaac ccgccagcgg gcctgggcct gggctgagct gcacccctcc 3371 

45 acctctgcct ccagctgagg gttggcttcc atctccacca ggcccagcac tgggcacagg 3431 

gctctcagag gcaggctctg aaagtcccct gctggcttct gcagtggact ccaggcgccg 3491 

agcccccagg gggctcgcat tgcgctcacc ctgcgaagcc acgtgaaggc tgggtcctcc 3551 

50 

cctccggaag ggccaaatgc' agggcatggg tggtttgaat ggtggcccct gggctccccg 3611 

gagggaccag ctgctgtgag ggccgccccc tccccacttc cgtcttgcat caccagctcc 3671 

55 tgtggcactc cccacgcccc gtcccccagt gggagcggca ggcccccggt ggctctgccc 3731 

gcggaggggg atgtgtgggc ggcggggtgg ccttgctgcc agatgctctg ccccgagtgt 3791 

ccgtctccgc tctccag g tgt cac cgc ctg cag gat tec ctg ttc agt tct 3842 
60 Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser 

65 70 75 

gac agt ggc ttc age aac tac cgt ggc ate ctg aat tgg tgt gtg gtg 3890 
Asp Ser Gly Phe Ser Asn Tyr Arg Gly lie Leu Asn Trp Cys Val Val 
65 80 85 90 

atg ctg gtacgtagag tgacaccttg gagcaagggt cctgacggcc ggggggccat 3946 
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Met Leu 

gggctcttct ccaggggtag gtgtctgtac 

5 

ctggcggtgg gggtccctgg agcagccgta 
ccgtccctgc tgaatcccag gagatgcctg 
10 ggtctgcgcc ccagggggca ctggagtctc 
ggtctggtgg ccctgggtgg gggatggctg 
caggtgagtg gtcttggggg tccacgtaga 

15 

gggtgcggca tgcctgggtg aacctggggg 
cccacagctc caggccacac tctgpaggac 
20 aggactggag tcctgtgtcc gccttgctct 
cagcaggcct gcccagcaga aggtggagtg 
ggcagggtga ggcccgactg ggcccagccc 

25 

ttcgcccctc cagctgacgt gtctgcctgc 
accaggggcc cccacgcttc tgtttgtggt 
30 ctgtgtgcca gtccagaccc agtagtccac 
ccaggactgt ggcaaggtgg tcgtgcaggg 
cagcccccag cctcctggcc ctttggtgca 

35 

ggggccgcct gctgcgtcct ctcggcagcc 
gctccaactc ttgcctgtgg tgggctctca 
40 caggaccacg tgtgtcctgg taagctggct 
caggtggccc ctgcatctct gagcctgtca 
tccacccatg tcgcctggag aagcatcact 

45 

ggcctgagcc tggggtgggg gaggtggtgg 
ctggccccag gccactggca agagctcgtt 
50 gcaggagcgg ccgaggcagg cgggggcgtg 
ccagcccctc aggaccggcc tgctctttcc 
cgcctgtgag gatgggcccg gttggcaggg 

55 

gcaggcttgg acttcactgg ggcttggggt 
tggtgggacg gacggccgct gggcagcagg 
60 gggttgtggt tggctccagg cgggcggggg 
tgggtgggga gtctcagagc ttggcgtgag 
gccactgtcc tgagctgcag gtgctggcag 

65 

tgacagcgtt atgtccctct ctctctatcg 



62 



ttgtgtagct 


gtggtgaatg 


gagctctgtg 


4006 


ccctgggacc 


ctaccgggag 


catgctcatg 


4066 


cagagggcag 


cctgggagcc 


tctgagctgg 


4126 


cccagggggc 


gagagagagt 


aggcagggat 


4186 


ctccgtgggc 


ccaggccctc 


cctggcagca 


4246 


acttcctctt 


ctgttccaaa 


ttgeccteat 


4306 


agcagggtga 


ggacatgctt 


ctcagcccag 


4366 


tctggcccct 


ccctcagcec 


tggagggagc 


4426 


gacctggccg 


aggccactgc 


tgtggggccc 


4486 


cagggacccc 


aggggcagcc 


ttcagggtgg 


4546 


caccgctcag 


tgctgatgtg 


gcgcgaggcc 


4606 


cctgggtgtg 


gctccagagg 


ctgcctgtgt 


4566 


tctgggcagt 


cccctgggga 


gcggtggggg 


4726 


gcgtcctggt 


ctctggaggc 


cgtggctggt 


4786 


caggccctca 


gcagcctgtc 


tgttetcctg 


4846 


cccacaaagc 


tcccccctcc 


cccaggagct 


4906 


tgggcttcca 


ggtggctggg 


ectcttagca 


4966 


ggacaggcaa 


ctgccagtcg 


gcagacattg 


502 6 


ggttaggtgt 


ttagctgggg 


gatggtgtgg 


5086 


cctcctcggg 


aagccttctg 


ggtgggggac 


5146 


tttccacaga 


gccttctgca 


acccccgtgg 


5206 


cccctgctcc 


tgcagaggcc 


agccaggcat 


■5266 


gtgttggggg 


atctgtcctt 


tgctgctgct 


5326 


agtaggggtg 


gagacccagg 


cccagcttcc 


5386 


caccacccca 


ccaagtgcgt 


gggcacaccc 


5446 


cggagccctg 


ggagggtggc 


agtgcgccgg 


5506 


tgtcgctgtg 


gccaggggcg 


ctgacccgct 


5566 


tttcttctgc 


cacggtggca 


caggcacctg 


5626 


ctgcgtgccc 


ctgcgcaggc 


acataggccg 


568 6 


gtcccacagg 


gctgggcctg 


caggatggag 


5746 


gagctggggt 


gggcgttctg 


gggccgtggc 


5806 


cag ate tta age aac gca egg tta 


5860 



He Leu Ser Asn Ala Arg Leu 
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10 



20 



30 



35 



40 



60 



63 

95 100 

ttt eta gag aac etc ate aa gtgagtgggc cccggcctgc cccagcccct 5910 
Phe Leu Glu Asn Leu lie Lys 
105 

gccacctcac ccctcgccta cacagaccct cacccacctg cgtctgcag g tat ggc 5966 

Tyr Gly 



ate ctg gtg gac ccc ate cag gtg gtg tet ctg tte ctg aag gae cec 6014 
lis Leu Val Asp Pro lie Gin Val Val Ser Leu Phe Leu Lys Asp Pro 
110 115 120 125 



15 tac age tgg cca get etg tge etg gte att g gtgagctggg tgcccaggag 6065 
Tyr Ser Trp Pro Ala Leu Cys Leu Val lie 
130 • 135 



gcctcaggcc ggcggtgggt gggacagggc tgatctgggc ctgaacctgc cctgggttgc 6125 

ttctgtcctc ag tg gee aat ate ttt gee gtg get gcg tte eag gtg gag 6175 
Val Ala Asn He Phe Ala Val Ala Ala Phe Gin Val Glu 
140 145 

25 aag ege ctg gee gtg gtaagcagtg ccctcacgcc ctcccctgac ttgcctcaag 6230 
Lys Arg Leu Ala Val 
150 



gtccttacca gtcgggctta gggcgggcca ccagctggtc ccactgtgct tcagggtttt 6290 

gggcctttcg tggccttcct gagaggggct gcacctcagg cctggtggct cttcctcagg 6350 

gaggtcctct gaccagggag gggggtccct ggctgacgct ctgctcccac cccag gga 6408 

Gly 

get etg acg gag cag gcg ggg ctg ctg ctg cac ggg gtc aac etg gee 6456 
Ala Leu Thr Glu Gin Ala Gly Leu Leu Leu- His Gly Val Asn Leu Ala 
155 160 165 170 

ace att etc tge tte cca gcg gee gtg gee ttt etc etc gag tct ate 6504 
Thr lie Leu Cys Phe Pro Ala Ala Val Ala Phe Leu Leu Glu Ser He 
175 180 185 

45 act cca g gtgggcccca cccccgcccc cgcccccgcc cacgctgtct cggccacggg 6561 
Thr Pro 



cagcgcgggg ggcgtggcct gagcttgcct ctcccacag tg ggc tec gtg ctg 6614 
50 Vai Gly Ser Val Leu 

X90 

gee ctg atg gtc tac acc ate etc tte etc aag ctg tte tec tac egg 6662 
Ala Leu Met Vai Tyr Thr lie Leu Phe Leu Lys Leu Phe Ser Tyr Arg 
55 195 200 205 

gac gtc aac etc tgg tge cga gag cgc agg get ggg gee aag gee aag 6710 
Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala Lys 
210 215 220 225 



get g gtgagggctg cctcgggctg gggccactgg gctgccactt gcctcgggac 6764 
Ala 



65 



cggcaggggc tcggctcacc cccgacccgc cccctgccgc ttgctcgtag et ttg 

Ala Leu 



6819 
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64 



35 



50 



55 



60 



65 



gca ggt aag aag gcc aac ggg gga get gcc cag cgc acc gtg age tac 6867 
Ala Gly Lys Lys Ala Asn Gly Gly Ala Ala Gin Arg Thr Val Ser Tyr 
230 235 240 

GCC gac aac etg acc tac cgc g gtgaggatce tgecgggggc tggggggact 6919 
Pro Asp Asn Leu Thr Tyr Arg 

245 . - 250 ' 



10 gcccggcggc ctggcctgct agccccgccc tcccttccag at etc tac tac ttc 

Asp Leu Tyr Tyr Phe 
255 



6973 



etc ttc gcc ccc acc etg tgc tac gag etc aac ttc ccc cgc tec ccc 7021 
15 Leu Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro 
260 265 270 

cgc ate cga aag cgc ttc etg etg egg cga etc etg gag atg 7063 
Arg He Arg Lys Arg Phe Leu Leu Arg Arg Leu Leu Glu Met 
20 275 280 285 

gtgaggcggg gectegcggg ecagggtggg cgggcctgcc ggcacccggc accggggctc 7123 

agctcactgt ccgcttgctt cettccccag etg ttc etc acc cag etc cag gtg 7177 
'25 Leu Phe Leu Thr Gin Leu Gin Val 

290 

ggg etg ate cag cag gtacgtgccc gggggggggg gggggggggg gggggggact 7232 
Gly Leu He Gin Gin 
30 295 

ctggggccgt tggggagctg actctgcget ttttgcag tgg atg gtc ccg gee ate 7288 

Trp Met Val Pro Ala He 
300 305 



cag aac tec atg aag ccc ttc aag gtgagcaggc aggcctggca gggtgggttc 7342 
Gin Asn Ser Met Lys Pro Phe Lys 
310 



40 cggggtcagg getgagggag ccagetgtgc cctgtgccea cag gac atg gac tac 7397 

Asp Met Asp Tyr 

315 

tec cgc ate gtg gag cgc etc etg aag etg gcg gtgagtgaec tgctgggtgg 7450 
45 Ser Arg He Val Glu Arg Leu Leu Lys Leu Ala 

320 325 



ggaegcgtgg gggcgggtgg ggetgttetg geacctggea eccactcccc acag gtc 7507 

Val 



ccc aac cae etc ate tgg etc ate ttc ttc tac tgg etc ttc eac tec 7555 
Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu Phe His Ser 
330 335 340 345 

tgc etg aac gcc gtg get gag etc atg cag ttt gga gac cgc gag ttc 7603 
Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
350 355 360 

tac egg gac tgg tg gtgggtggec ttgceggggc gggggtggtg ggggcccccg 7657 
Tyr Arg Asp Trp Trp 

365 

ctggggctgg ggccggagcc cctgcccaet ctgccccgec cccgcag g aac tec gag 7714 

Asn Ser Glu 
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tec ate acc tac ttc tgg cag aac tgg aac ate cct gtt cac aag tgg 7762 

Ser lie Thr Tyr Phe Trp Gin Asn Trp Asn lie Pro Val His Lys Trp 

370 375 380 385 

5 tgc ate ag gtgggtgtgc gcctgggggc ggggggttgg ggggtgggac 7810 
Cys lie Arg 

ggggtcgcgt ggcccgggcg cccagcccac tgccgcctcc cccgcag a eac tte tac 7867 
10 His Phe Tyr 

390 

aag ccc atg etc egg egg ggc age age aag tgg gca gee agg acg gca 7915 
Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Ala Ala Arg Thr 'Ala 
15 395 400 405 

gtg ttt ctg gee tec gee tte tte cae gag gtcagtgcac tgagggcgcg 7965 
Val Phe Leu Ala Ser Ala Phe Phe His Glu 
410 415 



20 



30 



55 



60 



65 



ccctgcccct ggtgggggtg ggggtggggg tgggggctcg ctgacgcccc tctcccctca 8025 



g tac etg gtg age ate eee ctg cgc atg tte egc etc tgg gee ttc acc 8074 
Tyr Leu Val Ser lie Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr 
25 420 425 430 



ggc atg atg gcg cag gtgagcagcc ctggaccccc gctccgcccc gccccgcgag 8129 
Gly Met Met Ala Gin 
435 

cgcagaggct cactcccgtc ctgtgtcccc ag ate ecg ctg gee tgg ata gtg 8182 

He Pro Leu Ala Trp He Val 
440 445 



35 ggc cgc ttc ttc cgc ggc aac tac ggc aac gcg gee gtg tgg ctg tea 8230 
Gly Arg Phe Phe Arg Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser 
450 455 460 

etc ate ate ggg cag ccg gtg gcc gtc ctg atg tac gtc cac gac tac 8278 
40 Leu He He Gly Gin Pro Val Ala Val Leu Met Tyr Val His Asp Tyr 
465 470 475 

tac gtg etc aac cgt gag gcg ccg gca gee ggc acc tgagcgcctc 8324 
Tyr Val Leu Asn Arg Glu Ala Pro Ala Ala Gly Thr 
45 480 485 



50 



caggctggcc 


ccctcgtggg 


tgttggactg 


ctttgccgcg 


ctgcctgcgg 


ctggactaga 


. 8384 


gcctgcccca 


acctgggcgc 


agcaggagga 


ggcctggctg 


gtggaagctg 


cctcctggcc 


8444 


tccaccaggc 


ctctgcctga 


agggcttcct 


cctgccaggg 


gagagcaggc 


ccgacgcagt 


8504 


tctggcccct 


gggaggtgcc 


catgctctgg 


aaaccctaca 


gatctcgccc 


aagggtctga 


8564 


atgtgtcaat 


aaagtgctgt 


gcacagtgag 


ctccctcagc 


ctccagggca 


cagggctggc 


8624 


aggagggggc 


ggccctccca 


cgtggggcca 


tgctgtggga 


aggaggcccc 


agcgcctgga 


8684 


gaggagctgg 


ggctgtggtg 


accctccctg 


cctcacaggg 


ctctgtggtc 


agacgtcttg 


8744 


ccctgcaagg 


tggagactcc 


atgctccaag 


gccccctgtg 


cctgaggtct 


gcacacaagt 


8804 


ggattcaact 


tgggtcaggc 


cagaggctaa 


ggtgtggaag 


aaggttgaga 


atcaggctga 


8864 


cttgaacggc. agcaaagact 


ccaaggcaag 


gctgcagaag 


tctcaaaagc 


tatgcgcaca 


8924 


gtcccctgct ggggtgctca 


cctgggctgg 


gctctgggct 


gcttggacaa 


agcaggtggc 


8984 
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ctggctcagc cctcaccgag ggccttcctt 
tccccgtttt tccaggccct cagcaggtag 
cagagctccg ccactcaaaa agcttggcag 
cctctactgg ctctgccgcc tgggttggaa 
10 tgacagccag ccatggccca gtaagatggg 
agctgtgctg cccccgcccc aaccagaagc 
gacaggctgg ttggctaggc actgtttggt 

15 

cccctgcctt ctaggaagcc accaccctgn 
ggctgagctg ggggtgcgtg ggctgctacg 
20 caccccctgc tgcctgactc aagagtgggg 
cccacggtgc tgacccagtg cacccacctg 
cctgtgggcc cacgctgatc ccgccctggt 

25 

gcccagggga gccaggaggc tccagtgtgc 
cgagacactg cgcatcatgc ccccctccac 
30 gggagaccac acacagagct cgttgtttat 
tccatgtata tatctgcata tctctatgga 
aaatacaatt ctatgaacac ctgcatcctg 

35 

tgagctgtgg ctcacagggc taggccctcg 
ctccccccct ccgccagcca ggggaccagg 
40 ctctcctcct ggggcttcta cgagacagtg 
gtcagcaggg agatggtggg gtcatctctt 
cctcccccag ctcgaagagc accggcaggt 

45 

ccaggagcag caggggctgg gcggtgtagt 
ggtcagggtg ccggtggggc cggggggcgg 
50 ccccggcccc agccaccccg cctacgcacg 
ctcatgtcgg gcaccgtaac cgaggggctg 
ctgccctgga agcggggcgg gctgggcgcg 

55 

gggtgctggt gtccacgctg aagccatggc 
tggagtccat ggcgtccaag tggtcgctga 
60 gtgacgtggc atccaggcac ccccaggtgc 
gcctgccctg cgcaggctca cgggaggccc 
gccaccggtg gtggctcagg ctccctcatc 

65 

gctgggccgg ctaagacatc agggggctga 
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66 



gggggcagag 


gttggcctga 


tgccaggggc 


9044 


ttgggtgtgg 


ccctcatgat 


accttggtcc 


9104 


tgaggcaagg 


gcaaccccgg 


gctgttcccc 


9164 


accctgaggc 


tgtgccaggc 


aggtgtaccc 


9224 


tgcccgaggt 


ggtacctggg 


cagcggaccc 


9284 


cgctctagcc 


catggtggtc 


gtctgggcga 


9344 


ctacagcagg 


tgtaggcagc 


gtctccctga 


9404 


gccctactca 


tcagcaagga 


cagcgagcag 


9464 


gnccccgcac 


cttcatcaca 


tgcacctctg 


9524 


gggggggtcc 


tgtgcttcct 


tcattccaga 


9584 


gtccgctagt 


gctgacctgg 


ccacagggct 


9644 


cccttcataa 


agaactcttg 


agcacatgca 


9704 


tgtgtccatc 


tgcctccctt 


cagccccttc 


97 64 


ccccacccac 


actggcagga 


ggaacacaca 


9824 


aaatctctgc 


ctggctcatc 


ggtctgtttg 


9884 


aggggaaagg 


gggactcgtg 


taaaaatcca 


9944 


gtcagtctga 


gtgtggccgt 


gaagcccagg 


10004 


gtgctggccg 


ggggccactc 


cccaccccct 


10064 


ctcctggaca 


ccaggcctgc 


ccaaggcctg 


10124 


gggtccttgg 


ctttgggggg 


ttctgagccc 


10184 


atatttcgtc 


tccctcggag 


aagtaggagc 


10244 


cgctgctccc 


cacgtccacg 


gagcccgggt 


10304 


gcaccagctg 


cttccctagg 


ggtgcgactg 


10364 


ggtggggnnn 


nnnnnnnnnn 


. nnnccccccc 


10424 


ctggccaggc 


tgctgtccag 


gtcgggcagg 


10484 


aacagctgca 


gggaagaggg 


gcgggtcaga 


10544 


gggcgggcag 


ggttagactc 


acatccagca 


10604 


ttgtcagcat 


ggtctgcagg 


ttgtccaggt 


. 10664 


gctcggtcct 


ggccagagga 


aggggagcag 


10724 


agtcctgccg 


gcccttaagc 


ccagctgcca 


.10784 


tgaggtgggg 


ccctgggcct 


gggacacttg 


10844 


accgtgaggc 


cccgtaaggc 


agcacgtcag 


10904 


gttcagggct 


cagaagggcc 


cggggcccca 


10964 



10 



20 
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gcttctcagg ttcagcaacg cctgcccttc acactggagg acagcaggag ggtggagctg 11024 

tgctcggcca ggctgggcca gcctacccca gcatgcgcca ggatagatta ggcctgcccc 11084 

ctcgaggagg ccaggcagta cctcggggtg ctgcctcagg ccccagaggt gggtggggag 11144 

cctgttcaca ggctgggcag gggtctgctc agaggctggg gaggggccca agctctggcg 11204 

gagccctcct ccctccccag tgggacagcg ctaaccctgg ctggactcgg ccatgcagag 11264 

aggaggaggg gcaggggaag aggcgggcga ccccagacat ctttggagtg cgagccaaac 11324 

tgcaagatag aaagacaaga gcccccctct gcgggtgggg cccagcgagt ccaagccccc 11384 

15 caccccgttc acccacctct ggggtggtag gacagacaga ccaggcggcc cggggagggc 11444 

ggctcagggg gcaggggtgc ccgccacgtg cctggttccc acggtgcagg ggctgcggga 11504 

ccccctggct ggcatgtctc cctccccagc tccatgctct gggccaggtc ggccagtttc 11564 

ccccagcaaa tcccatgccg agggcctgag ggccatgcgt gtccagcccg gcactgtcct 11624 

tgtcggaggc ccctgcgctt ccagagaccg aagggcgcct ggaaggcact cacttgtcta 11684 

25 ggcaggcgac gctgaggcac ttctcgggag ccgaggcggg gagggtcgac ggatagcggc 11744 

cccgagtgat ccgatagaag cttcgta 11771 

30 <210> 2 

<211> 489 

<212> PRT 

35 

<213> Bos taurus 

40 <220> 

<221> misc_f eature 

<222> (8315) , . (8317) 

45 

<223> translation stop codon 
<220> 

50 <221> misc_feature 

<222> (9434) . . (9434) 

<223> ambiguous nucleotide 

55 

<220> 

<221> misc_feature 

60 <222> (9496) (9496) 

<223> ambiguous nucleotide 
<220> 

<221> misc feature 



65 
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<222> (10402). . (10417) 
<223> ambiguous nucleotides 
5 <400> 2 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 
1 5 . 10 15 

10 

Ser Arg Pro Ser lie Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 '30 



15 Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 



Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
20 50 55 60 



Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser 
65 70 75 80 

25 

Asn Tyr Arg Gly lie Leu Asn Trp Cys Val Val Met Leu He Leu Ser 
85 90 95 

30 

Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val 
100 105 110 

35 Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp 
115 120 125 



Pro Ala Leu Cys Leu Val He Val Ala Asn He Phe Ala Val Ala Ala 
40 130 . 135 140 



Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Ala 
145 150 155 160 

45 

Gly Leu Leu Leu His Gly Val Asn Leu Ala Thr He Leu Cys Phe Pro 
165 170 175 

50 

Ala Ala Val Ala Phe Leu Leu Glu Ser He Thr Pro Val Gly Ser Val 
180 185 190 

55 "Leu Ala Leu Met Val Tyr Thr He Leu Phe Leu Lys Leu Phe Ser Tyr 
195 200 205 



Arg Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala 
60 210 215 220 



Lys Ala Ala Leu Ala Gly Lys Lys Ala Asn Gly Gly Ala Ala Gin Arg 
225 230 235 240 

65 



Thr Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe 
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245 250 255 



Leu Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro 
5 260 265 270 



Arg lie Arg Lys Arg Phe Leu Leu Arg Arg Leu Leu Glu Met Leu Phe 
275 280 285 

10 

Leu Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met Val Pro Ala 
290 295 300 

15 

He Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg He 
305 310 315 320 



20 Val Glu Arg Leu Leu Lys Leu Ala Val Pro Asn Kis Leu He Trp Leu 
325 330 335 



He Phe Phe Tyr Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu 
25 340 345 350 



Leu Met Gin Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser 
355 360 365 

30 

Glu Ser He Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys 
370 375 380 

35 

Trp Cys He Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser 
385 390 395 400 



40 Lys Trp Ala Ala Arg Thr Ala Val Phe Leu Ala Ser Ala Phe Phe His 
405 410 415 



Glu Tyr Leu Val Ser He Pro Leu Arg Met Phe Arg Leu Trp Ala Phe 
45 420 425 430 



Thr Gly Met Met Ala Gin He Pro Leu Ala Trp He Val Gly Arg Phe 
435 440 445 

50 

Phe Arg Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser Leu He He 
450 455 460 

55 

Gly Gin Pro Val Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu 
465 470 475 480 



60 Asn Arg Glu Ala Pro Ala Ala Gly Thr 
485 



<210> 3 

65 

<211> 31 
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10 



25 



35 



45 



55 



65 



70 

<212> DNA . 
<213> Bos taurus 

<220> 

<221> misc_signal 
<222> (29).. (31) 



<223> bases! to 3 of the Kozak recognition sequence. See the genomic a 
equence from the start codon for bases 4 to 7 of the Kozak recogn 
15 ition sequence or the D(3AT1 cDNA for the complete recognition seq 

uence . 



20 <400> 3, 

acttggccgc ggcggggtgc gaactaaggc c 31 



<210> 4 

<211> 1732 

<212> DNA 

30 <213> Bos taurus 



<220> 

<221> misc__feature 

<222> (1)..(3} 

40 <223> translation start codon 



<220> 

<221> misc_feature 

<222> (1468) . . (1470) 

50 <223> translation stop codon 



<220> 

<221> variation 

<222> (689) . . (755) 

60 <223> this sequence is deleted in the alternately spliced transcript 



<220> 

<221> variation 



10 



25 



60 



65 
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<222> (358).. (358) 

<223> cytosine (C) to thyamine (T) substitution polymorphism 
<220> 

<221> variation 
<222> (694) , , (695) 



<223> adenine (A) -adenine (A) to guanine (G) -cytosine (C) substitution p 
olymorphism 
15 AA corresponds to the Q allele 

GC corresponds to the q allele 



20 <220> 

<221> primer_bind 
<222> (616) . . (635) 



<223> Primer DgatforAD 

TTCTCCTACCGG6ACGTCAA 



30 

<220> 

<221> primer_bind • 

35 <222> (743).. (767) 

<223> Primer DgatrevAD 

AAGTAGTAGAGATCGCGGTAGGTCA 
reverse primer 

40 

<220> 

45 <221> primer_bind 

<222> (690) . . (703) 

<223> Primer FofAA (FAM) 
50 CGTTGGCCTTCTTA 
reverse primer' 

55 <220> 

<221> prime r_bind 
<222> (688) . . (709) 



<223> Primer DgatADGC (VIC) 
TTGGCCGCCTTACC 
reverse primer 
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72 

<220> 

<221> pri2ner_bind 

<222> (617) . . (636) 

<223> Primer DgatforRT66 

TCTCCTACCGGGACGTCAAC 

<220> 

<221> primer_bind 
<222> (739) . . (760) 



<223> Primer DgatrevRT66 

GAGATCGCGGTAGGTCAGGTT 
20 reverse primer 



<220> 

25 

<221> primer^bind 

<222> (676) . . (767) 

30 <223> Primer Dgatf orRTless66 

GCT GCTTTGGCAGATCTCTACTACTT 

This primer selectively binds and amplifies a c'ontigious sequence 
characteristic of the splice variant & generated by the deletion 
bases 689 to 755 of this sequence. 

35 

<220>- 

40 <221> prime r_bind 
<222> (817).. (833) 



<223> Primer DgatrevRTless66 
45 AAGCGCTTTCGGATGCG 



<220> 

50 

<221> primer_bind 

<222> (722) . . (734) 

55 <223> Primer Dgatwith66 (FAM) 
CCGTGAGCTACCC 



60 <220> 

<221> primer_bind 
<222> (771) . , (785) 

65 

<223> Primer Dgatless66 (VIC) 
CTTCGCCCCCACCCT 
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<400> 4 

5 atgggcgacc gcggcggcgc gggcggctcc 
atccagggcg gcagtgggcc cgcggcagcg 
ggggacgcgc cggtccggga cacagacaag 

10 

tgggacctga ggtgtcaccg cctgcaggat 
aactaccgtg gcatcctgaa ttggtgtgtg 
15 tttctagaga acctcatcaa gtatggcatc 
ttcctgaagg acccctacag ctggccagct 
gccgtggctg cgttccaggt ggagaagcgc 

20 

gggctgctgc tgcacggggt caacctggcc 
tttctcctcg agtctatcac tccagtgggc 
25 ctcttcctca agctgttctc ctaccgggac 
ggggccaagg ccaaggctgc tttggcaggt 
accgtgagct accccgacaa cctgacctac 

30 

accctgtgct acgagctcaa cttcccccgc 
cggcgactcc tggagatgct gttcctcacc 
35 atggtcccgg ccatccagaa ctccatgaag 
gtggagcgcc tcctgaagct ggcggtcccc 
tggctcttcc actcctgcct gaacgccgtg 

40 

ttctaccggg actggtggaa ctccgagtcc 
cctgttcaca agtggtgcat cagacacttc 
45 aagtgggcag ccaggacggc agtgtttctg 
agcatccccc tgcgcatgtt ccgcctctgg 
ctggcctgga tagtgggccg cttcttccgc 

50 

tcactcatca tcgggcagcc' ggtggccgtc 
aaccgtgagg cgccggcagc cggcacctga 
55 tggactgctt tgccgcgctg cctgcggctg 
aggaggaggc ctggctggtg gaagctgcct 
gcttcctcct gccaggggag agcaggcccg 

60 

gctctggaaa ccctacagat ctcgcccaag 



<210> 5 

65 

<211> 489 



73 



cggcgccgga 


ggacggggtc 


gcggccttcg 


60 


gaagaggagg 


tgcgggatgt 


gggcgccgga 


120 


gacggagacg 


tagacgtggg 


cagcggccac 


180 


tccctgttca 


gttctgacag 


tggcttcagc 


240 


gtgatgctga 


tcttaagcaa 


cgcacggtta 


300 


ctggtggacc 


ccatccaggt 


ggtgtctctg 


360 


ctgtgcctgg 


tcattgtggc 


caatatcttt 


420 


ctggccgtgg 


gagctctgac 


ggagcaggcg 


480 


accattctct 


gcttcccagc 


ggccgtggcc 


540 


tccgtgctgg 


ccctgatggt 


ctacaccatc 


600 


gtcaacctct 


ggtgccgaga 


gcgcagggct 


660 


aagaaggcca 


acgggggagc 


tgcccagcgc 


720 


cgcgatctct 


actacttcct 


cttcgccccc 


780 


tccccccgca 


tccgaaagcg 


cttcctgctg 


840 


cagctccagg 


tggggctgat 


ccagcagtgg 


900 


cccttcaagg 


acatggacta 


ctcccgcatc 


960 


aaccacctca 


tctggctcat 


cttcttctac 


1020 


gctgagctca 


tgcagtttgg 


agaccgcgag 


1080 


atcacctact 


tctggcagaa 


ctggaacatc 


1140 


tacaagccca 


tgctccggcg 


gggcagcagc 


1200 


gcctccgcct 


tcttccacga 


gtacctggtg 


1260 








1320 


ggcaactacg 


gcaacgcggc 


cgtgtggctg 


1380 


ctgatgtacg 


tccacgacta 


ctacgtgctc 


1440 


gcgcctc'cag 


gctggccccc 


tcgtgggtgt 


1500 


gactagagcc 


tgccccaacc 


tgggcgcagc 


1560 


cctggcctcc 


accaggcctc 


tgcctgaagg 


1620 


acgcagttct 


ggcccctggg 


aggtgcccat 


1680 


ggtctgaatg 


tgtcaataaa 


ga 


1732 
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<212> PRT 

<213> Bos taurus 

5 

<220> 

<221> VARIANT 

10. 

<222> (232).. (232) 

<223> an amino acid substitution (K -> A) caused by a polymorphism at b 
ases 7224-7225 of the genomic sequence (measured from the adenine 
15 residue of the translation start codon) . Lysine (K) corresponds 

to the Q allele, alanine (A) corresponds to -the q allele. 



20 <400> 5. 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 
1 5 . 10 15 

25 

Ser Arg Pro Ser lie Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 30 



30 Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 



Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
35 50 55 60 



Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser 
65 70 75 80 

40 

Asn Tyr Arg Gly lie Leu Asn Trp Cys Val Val Met Leu He Leu Ser 
85 90 95 

45 

Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val 
100 105 110 



50 Asp Pro lie Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp 
115 120 125 



Pro Ala Leu Cys Leu Val He Val Ala Asn He Phe Ala Val Ala Ala 
55 130 135 140 



Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Ala 
145 150 155 160 

60 

Gly Leu Leu Leu His Gly Val Asn Leu Ala Thr He Leu Cys Phe Pro 
165 170 175 

65 

Ala Ala Val Ala Phe Leu Leu Glu Ser He Thr Pro Val Gly Ser Val 
180 185 190 



r 
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Leu Ala Leu Met Val Tyr Thr He Leu Phe Leu Lys Leu Phe Ser Tyr 
195 200 205 



Arg Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala 
210 215 220 



Lys Ala Ala Leu Ala Gly Lys Lys Ala Asn Gly Gly Ala Ala Gin Arg 
225 230 235 240 



Thr Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe 
245 250 255 



Leu Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro 
260 265 270 



Arg He Arg Lys Arg Phe Leu Leu Arg Arg Leu Leu Glu Met Leu Phe 
275 280 285 



Leu Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met Val Pro Ala 
290 295 300 



He Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg He 
305 310 315 320 



Val Glu Arg Leu Leu Lys Leu Ala Val Pro Asn His Leu He Trp Leu 
325 330 335 



He Phe Phe Tyr Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu 
340 345 350 



Leu Met Gin Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser 
355 360 365 



Glu Ser He Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys 
370 375 380 



Trp Cys He Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser 
385 390 395 400 



Lys Trp Ala Ala Arg Thr Ala Val Phe Leu Ala Ser Ala Phe Phe His 
405 410 415 



Glu Tyr Leu Val Ser He Pro Leu Arg Met Phe Arg Leu Trp Ala Phe 
420 425 430 



Thr Gly Met Met Ala Gin He Pro Leu Ala Trp He Val Gly Arg Phe 
435 440 445 



Phe Arg Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser Leu He He 
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450 455 460 



Gly Gin Pro Val Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu 
5 465 470 475 480 



Asn Arg Glu Ala Pro Ala Ala Gly Thr 
485 

10 

<210> 6 ^ 
<211> 467 

15 

<212> PRT 

<213> Bos taurus 

20 

<400> 6 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 
25 1 5 10. 15 



Ser Arg Pro Ser He Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 30 

30 

Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 

35 

Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
50 55 60- 



40 Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser 
65 70 75 80 



Asn Tyr Arg Gly He Leu Asn Trp Cys Val Val Met Leu He Leu Ser 
45 85 90 95 



Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val 
100 105 110 

50 

Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp 
115 120 125 

55 

Pro Ala Leu Cys Leu Val He Val Ala Asn He Phe Ala Val Ala Ala 
130 135 140 



60 Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Ala 
145 150 155 160 



Gly Leu Leu Leu His Gly Val Asn Leu Ala Thr He Leu Cys Phe Pro 
65 165 170 175 
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Ala Ala Val Ala Phe Leu Leu Glu Ser lie Thr Pro Val Gly Ser Val 
180 185 190 

5 Leu Ala Leu Met Val Tyr Thr lie Leu Phe Leu Lys Leu Phe Ser Tyr 
195 200 205 

Arg Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala 
10 210 215 220 

Lys Ala Ala Leu. Ala Asp Leu Tyr Tyr Phe Leu Phe Ala Pro Thr Leu 
225 230 235 240 

15 

Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys Arg Phe 
245 250 255 

20 

Leu Leu Arg Arg Leu Leu Glu Met Leu Phe Leu Thr Gin Leu Gin Val 
260 265 270 

25 Gly Leu He Gin Gin Trp Met Val Pro Ala He Gin Asn Ser Met Lys 
275 280 285 

Pro Phe Lys Asp Met Asp Tyr Ser Arg He Val Glu Arg Leu Leu Lys 
30 290 295 300 

Leu Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu 
305 310 315 320 

35 

Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp 
325 330 335 

40 

Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser He Thr Tyr Phe 
340 345 350 

45 Trp Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe 
355 360 365 

Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Ala Ala Arg Thr 
50 370 375 380 

Ala Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val Ser He 
385 390 395 400 

55 

Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly Met Met Ala Gin 
405 410 415 

60 

He Pro Leu Ala Trp He Val Gly Arg Phe Phe Arg Gly Asn Tyr Gly 
420 425 430 



65 Asn Ala Ala Val Trp Leu Ser Leu He He Gly Gin Pro Val Ala Val 
435 440 445 
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5 



15 



25 



35 



45 



78 



Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn Arg Glu Ala Pro Ala 
450 .455 460 



Ala Gly Thr 
465 



10 <210> 7, 

<211> 12 

<212> DNA 

<213> Bos taurus 



20 <220> 

<221> variation 

<222> (7) -.{7) 

<223> thymidine (T) to guanine (G) substitution polymorphism 



30 <400> 7 

cagtgctagg gg 12 



<210> 8 

<211> 10 

<212> DNA 

40 <213> Bos taurus 



<220> 

<221> . variation 

<222> {6).. (6) 

50 <223> thymidine (T) to guanine (G) substitution polymorphism 



<400> 8 

55 ' gcattgcgct 10 



<210> 9 

60 <2ii> 11 

<212> DNA 

<213> Bos taurus 

65 



10 



20 



30 



40 



50 



60 
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<220> 

<221> variation 
<222> (6).. (6) 

<223> thymidine (T) to cytosine (G) substitution polymorphism 



<400> 9 

taccctggga c il 



15 <210> 10 

<211> 11 

<212> DNA 

<213> Bos taurus 



25 <220> 

<221> variation 

<222> (6).. (6) 

<223> adenine (A) to guanine (G) substitution polymorphism 



35 <220> 

<221> variation 

<222> (6).. (6) 

<223> adenine (A) to guanine (G) substitution polymorphism 



45 <400> 10 

ctcttagcag c li 



<210> 11 

<211> 10 

<212> DNA 

55 <213> Bos taurus 



<220> 

<221> variation 

<222> (5).. (5) 

65 <223> guanine (G) to adenine (A) substitution polymorphism 
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<400> 11 

acaggcaact 10 

<210> 12 

<211> 11 

<212> DNA 

<213> Bos taurus 



<220> 

<221> variation 

<222> (6).. (6) 

<223> cytosine (C) to thyamine (T) substitution polymorphism 



<400> 12 

tgtctctgtt c 11 



<210> 13 

<211> 14 

35 <212> DNA 

<213> Bos taurus 

40 

<220> 

<221> variation 

45 <222> (7) . . (8) 

<223> adenine (A) -adenine (A) to guanine (G) -cytosine (C) substitution p 
olymorphism 

AA corresponds to the Q allele 
50 GC corresponds, to the q allele 



<400> 13 

55 ggtaagaagg ccaa 14 

<210> 14 

60 <2il> 11 

<212> DNA 

<213> Bos taurus 

65 
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<220> 

<221> variation 
5 <222> (6).. (6) 

<223> guanine (G) to adenine (A) substitution polymorphism 



10 



20 



30 



35 



40 



60 



<400> 14 

gcggtgagga t 11 



15 <210> 15 

<211> 16 

<212> DNA 

<213> Bos taurus 



25 <220> 

<221> variation 
<222> (6).. (7) 



<223> guanine (G) -guanine (G) to adenine {A)-cytosine (C) substitution 
polymorphi sm 
GG-AC 



<400> 15 

gggggggggg gactct 16 



<210> 16 

<211> 11 

45 <212> DNA 

<213> Bos taurus 

50 

<220> 

<221> variation 

55 <222> (6) . . (6) 

<223> adenine (A) to guanine (G) substitution polymorphism 



<400> 16 

gagtgacctg c 11 



65 <210> 17 
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<211> 11 
<212> DNA 
5 <213> Bos taurus 



10 



<220> 

<221> variation 

<222> (6).. (6) 

15 <223> cytosine (C) to thymidine (T) substitution polymorphism 

<400> 17 

20 ggacgcgtgg g 11 

<210> 18 

25 <2il> 11 

<212> DNA 

<213> Bos taurus 

30 

<220> 

35 <221> variation 

<222> (6) .. (6) 

<223> guanine (G) to adenine (A) substitution polymorphism 

40 



45 



50 



55 



60 



65 



<400> 18 

ggtgggggtg g 11 

<210> 19 

<211> 11 

<212> DNA 

<213> Bos taurus 



<220> 

<221> variation 

<222> (6).. (6) 

<223> ctosine (C) to thymidine (T) siibstitution polymorphism 



<400> 



19 
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II 



<210> 20 

5 

<2ai> 14 

<212> DNA 

10 <213> Bos taurus 



<400> 20 
15 cgttggcctt ctta 



14 



<210> 21 

20 <211> 14 

<212> DNA 

<213> Bos taurus 

25 



30 



<400> 21 
ttggccgcct tacc 



14 



35 
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CLAIMS: 

1. A method of determining genetic merit of a bovine with respect to* milk . 
composition and volume which comprises the step of determining the DGATl genotypic 

5 state of said bovine. 

2. A method as claimed in claim 1, wherein the genotjrpic state is determined with 
respect to DNA, mRNA and/ or protein obtained from said bovine by direct or indirect 
methods. 

10 

3. A method as claimed in claim 1, wherein the genotypic state is determined by 
the presence of at least one nucleotide difference from the nucleotide sequence encoding 
bovine DGATl (SEQ ID NO: 1) either by direct or indirect methods. 

15 4. A method as claimed in claim 3, wherein the genotypic state is determined by 
the presence of one or more polymorphisms of SEQ ID NOs: 7 to 19 either by direct or 
indirect methods. 

5. A method as claimed in claim 4, wherein the genotypic state is determined by 
20 detecting the presence of the K232A polymorphism (SEQ ID NO: 13), either by direct or 

indirect mehtods. 

6. A method of selecting a bovine having a desired DGATl genotypic state 
comprising determining the genotypic state according to any one of claims 1 to 5 and 

25 selecting said bovine on the basis of said determination. 

7. A bovine selected by the method of claim 6. 

8. A method of identifying a bovine which possesses a genotype indicative of 
30 altered milk production traits, said method comprising: 

obtaining a nucleic acid sample from said bovine and identifying a 
polymorphism selected from the group comprising SEQ ID NOs: 7 to 19 of the bovine 
DGATl gene, 

wherein the presence of said poljmiorphism is associated with altered milk 
35 production traits. 
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9. A method as claimed in claim 8, wherein said altered milk production traits 
comprise an increase in milk volume and/ or an increased protein to fat ratio of the milk 
composition or a decrease in milk volume and/or an increase in fat content of the milk 
composition. 

10. A method as claimed in claim 8, wherein the polymorphism is K232A (SEQ ID 
NO: 13), wherein the presence or absence of an alanine is associated with the altered 
milk production traits as claimed in claim 9. 

11. A method as claimed in claim 8, further comprising the step of amplifying said 
bovine DGATl gene sequence. 

12. A method as claimed in claim 11, wherein primers selected from the group 
consisting of (SEQ ID NOs: 20 and 21) are used in said amplification. 

13. A primer suitable for use in detecting a polymorphism selected from the group 
comprising SEQ ID NOs: 7 to 19 of the bovine DGATl gene, said primer consisting of a 
nucleotide sequence having about at least 12 contiguous bases of SEQ ID NOs. 1, 3 or 
4. 

14. A bovine identified by the method of any one of claims 8 to 12. 

15. A bovine as claimed in claim 7 or 14, comprising a bull. 

16. Semen produced by a bovine as claimed in claim 15. 

17. A bovine as claimed in claim 7 or 14, comprising a cow. 

18. Milk produced by a bovine as claimed in claim 17. 

19. Milk as claimed in claim 18, comprising one or more altered characteristics 
selected from the group consisting of increased % protein; increased total protein 
content; increased protein to fat ratio; decreased total fat content; increased fat 
softness; increase unsaturated fat content; increased or decreased solid fat content; and 
increased or decreased total volume; when compared to milk produced by a non- 
genotyped bovine. 
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20. Milk as claimed in claim 18, which has a decreased solid fat content, and 
increased fat softness and is advantageous for use in dairy product processing plants. 

5 21. A dairy product made from the milk as claimed in any one of claims 18 to 20. 

22. A dairy product made from the milk as claimed in claim 20, wherein said 
product provides a beneficial health effect. 

10 23, The use of the DGATl gene sequence (SEQ ID NOs: 1, 3 and 4) or a fragment or 
variant thereof, in the identification of one or more molecular DNA markers useful in 
genotyping and /or selecting a bovine according to the methods of any one of claims 1 to 
6 and 8 to 12. 

15 24. The use of one or more polymorphism sequences selected from the group 
consisting of SEQ ID NOs: 7 to 19 in a method of identification and selection of a bovine 
having at least one of said polymorphisms in its DGATl gene. 

25. The use of a probe in the methods of genotyping according to any one of claims 
20 1 to 6 and 8 to 12, wherein the probe is selected from any 5 or more contiguous 
nucleotides of the DGATl sequence of SEQ ID NOs: 1, 3 or 4, which is sufficiently 
complementary with said nucleic acid sequence so as to bind thereto under stringent 
conditions. 

25 26. A kit for genotyping a bovine with respect to milk composition and volume 
associated with DGATl, comprising a primer or probe as claimed in claim 13 or 25. 

27. An isolated nucleic acid molecule having the sequence of SEQ ID NOs: 1 or 4 
with or without one or more polymorphisms selected from the group consisting of SEQ 

30 ID Nos: 7 to 19, or a fragment or variant thereof. 

28, An isolated nucleic acid molecule comprising a DNA molecule having in whole 
or in part the nucleotide sequence of SEQ ID NOs: 1, 3 or 4 or which varies from the 
sequence due to the degeneracy of the genetic code, or a nucleic acid strand capable of 

35 hybridising with said nucleic acid molecule under stringent hybridisation conditions. 
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29. An isolated mRNA transcribed from DNA having a sequence wiiich corresponds 
to a nucleic acid molecule as claimed in claim 27 or 28. 

30. A recombinant cloning vector comprising the DNA molecule of claim 27 or 28. 

5 

31. A prokaryotic or eukaryotic cell containing the cloning vector of claim 30. 

32. A transfected cell line which expresses a protein encoded by the nucleic acid 
molecule of claim 27 or 28. 

10 

33. A diagnostic kit useful in detecting a DNA molecule as claimed in claim 27 or 
28, comprising a first and second primer for amplifying the DNA, the primers being 
complementary to nucleotide sequences of the DNA upstream and downstream, 
respectively, of a polymorphism in the portion of the DNA encoding DGATl which 

15 results in altered relative milk lipid and protein production and milk voliome, wherein at 
least one of the nucleotide sequences is selected to be from a non-coding region of the 
DGATl gene. 

34. A kit as claimed in claim 33, further comprising a third primer complementary 
20 to a polymorphism selected from the group consisting of of SEQ ID NO: 7 to 19 located 

on the DGATl gene. 



35. A process for producing a protein encoded by a DNA molecule of claim 27 or 28, 
comprising the steps: 

25 a) preparing a DNA fragment including a nucleotide sequence which encodes the 
protein; 

b) incorporating the DNA fragment into an expression vector to obtain a 
recombinant DNA molecule which includes the DNA fragment and is capable of 
undergoing replication; 

30 c) transforming a host cell with the recombinant DNA molecule to produce a 

transformant which can express the protein; 

d) culturing the transformant to produce the protein; and 

e) recovering the protein from resulting cultured mixture. 



35 



36. A purified protein having an amino acid sequence of SEQ ID NOs: 2, 5 or 6 or a 
fragment or variant thereof having biological activity of DGATl . 
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37. An antibody raised against the protein as claimed in claim 36. 

38. A method for the modulation of milk production and content in a bovine by 
modulating the activity of the DGATl protein. 

5 

39. A method as claimed in claim 38, comprising administering to the bovine an 
effective amount of a nucleic acid molecule substantially complementary to at least a 
portion of mRNA encoding the bovine DGATl protein and being of sufficient length to 
sufficiently reduce expression of said DGATl . 

10 

40. An isolated antisense nucleic acid molecule for use in a method as claimed in 
claim 39. 

41. A method of modulating milk production and/or milk content in a lactating 
15 bovine, including administering to the bovine an effective amount of a nucleic acid 

molecule having ribozyme activity and a nucleotide sequence substantially 
complementary to at least a portion of mRNA encoding a bovine DGATl and being of 
sufficient length to bind selectively thereto to sufiiciently reduce expression of said 
DGATl. 

20 

42. A method of inhibiting the activity of bovine DGATl in a lactating bovine so as 
to modulate milk production and/ or milk solids content, comprising administering an 

' effective amount of an antibody to the relevant DGATl . 

25 43. A method of modulating milk production and/ or milk solids content by raising 
an autoantibody to a bovine DGATl in the bovine. 

44. A transgenic animal having cells that contain a transgene, which transgene is 
introduced into the animal or an ancestor of the animal at a prenatal stage, wherein 

30 said transgene comprises a bovine cDNA comprising the nucleotide sequence of SEQ ID 
NO: 4 or an appropriate variant or subsequence thereof. 

45. A transgenic non-human animal wherein said endogenous DGATl gene has 
been knocked out and replaced with a transgene. 

35 

46. A transgenic non-human animal, wherein additional copies of the gene 
encoding DGATl are inserted as a transgene. 
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47. A transgenic non-human animal as claimed in claim 45 comprising a 
transgenic bovine having a genome lacking a gene encoding a protein having biological 
activity of DGATl. 

5 

48. A transgenic non-human animal comprising a mouse having a genome 
containing a gene encoding a bovine protein having biological activity of DGATl. 

49. A transgenic bovine having a gene encoding a bovine protein having biological 
10 activity of bovine DGATl and heterologous nucleotide sequence antisense to the gene. 

50. A transgenic bovine having a genome which includes additional copies of a gene 
encoding a protein having biological activity of DGATl or copies of a gene encoding a 
protein having biological activity of DGATl under control of a high expression promoter. 

15 




FIGURE 1 
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<222> (29).. (31) 

<223> bases 1 to 3 of the Kozak recognition sequence. See the genomic s 
equence from the start codon for bases 4 to 7 of the Kozak recogn 
ition sequence or the DGATl cDNA for the complete recognition seq 
uence . 



acttggccgc ggcggggtgc gaactaaggc c 



FIGURE 2a 
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<210> 1 
<211> 11771 
<212> DNA 
<213> Bos taurus 

<220> 

<221> CDS 

<222> (1) . . (191) 

<223> Exon 1 CDS, determined by alignment with an amino acid sequence d 
educed from the cONA 

<220> 

<221> misc^signal 
<222> (1)..(4) 

<223> these bases correspond to bases 4 to 7 of the Kozalc recognition s 
equence. See DGATl cDNA for the con¥)lete recognition sequence. 

<220> 

<221> CDS 

<222> (3809) . . (3896) 

<223> Exon 2, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

<220> 

<221> CDS 

<222> (5840) . .'(5880) 

<223> Exon 3, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 



FIGURE 2b 
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<220> 

<221> CDS 

<222> (5960) . . (6045) 

<223> Exon 4, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

<220> 
<221> CDS 

<222> (6138) (6190) 

<223> Exon 5, determined by alignment with an amino acid sequence deduc 
.ed from the cDNA 

<220> 

<221> CDS 

<222> (6406) . . (6511) 

<223> Exon 6, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

<220> 

<221> CDS 

<222> (6601) (6714) 

<223> Exon 7, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 

<220> 

<221> CDS 

<222> (6815) . . (6889) 

<223> Exon 8, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 



FIGURE 2b continued 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



5/34 

variation 
(6824) . . (6889) 

this secpience is deleted from the alternately spliced transcript. 



CDS 

(6960) .. (7063) 

Exon 9, determined by alignment with an amino acid sequence deduc 
ed from the cDNA 



CDS 

(7154) . . (7192) 

Exon 10, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 



CDS 

(7271) (7312) 

Exon 11, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 



CDS 

(7386).. . (7430) 

Exon 12, determined by alignment with an amino acid sequence .ded 
uced from the cDNA 



FIGURE 2b continued 
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<220> 

<221> CDS 

<222> (7505) . . (7617) 

<223> Exon 13, determined by aligimient with an amino acid sequence dedu 
ced from the cDNA 

<220> 

<221> CDS 

<222> (7705) . . (7770) 

<223> Exon 14, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 

<220> 
<221> CDS 

<222> (7858) . ; (7945) 

<223> Exon 15, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 

<220> 

<221> CDS 

<222> (8027) .. (8089) 

<223> Exon 16, determined by alignment with an amino acid sequence dedu 
ced from the cDNA 

<220> 

<221> CDS 

<222> (8162) (8314) 

<223> Exon 17 CDS, determined by alignment with an amino acid sequence 
deduced from the cDNA 



FIGURE 2b continued 
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<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



7/34 



mis cofeature 
(8315) . . (8317) 
translation stop codon 



polyA_site 
(8572) (8578) 



variation 
(626) . . (626) 

thymidine (T) to guanine (G) substitution polymorphism 

variation 
(3512) (3512) 

thymidine (T) to guanine (G) substitution polymorphism 

variation 
(4040) . . (4040) 

thymidine (T) to cytosine (G) substitution polymorphism 

variation 
(4963) .. (4963) 

adenine (A) to guanine (G) substitution polymorphism 



FIGURE 2b continued 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



8/34 

variation 
(5003) (5003) 

guanine (G) to adenine (A) substitution polymorphism 

variation 
(5997) (5997) 

cytosine (C) to thyamine (T) substitution polymorphism 



variation 
(6829) . . (6830) 

adenine (A) -adenine (A) to guanine (G) -cytosine (C) substitution p 
olymorphism 

AA corresponds to the Q allele 
GC corresponds to the q allele 



variation 
(6892) . . (6892) 

guanine (6) to adenine (A) substitution polymorphism 



variation 
(7224) (7225) 

guanine (G) -guanine (G) to adenine (A) -cytosine (C) substitution 

po 1 ymo r phi sm 

GG-AC 



FIGURE 2b continued 
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<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



9/34 

variation 
(7438) . . (7438) 

adenine (A) to guanine (G) substitution polymorphism 



variation 
(7456) . . (7456) 

cytosine (C) to thymidine (T) substitution polymorphism 

variation 
(7987) . . (7987) 

guanine (G) to adenine (A) substitution polymorphism 

variation 
(8402) . . (8402) 

ctosine (C) to thymidine (T) substitution polymorphism 



misc_f eature 
(9434) . . (9434) 
ambiguous nucleotide 



misc^f eature 
(9496) . . (9496) 
ambiguous nucleotide 



FIGURE 2b continued 



wo 02/36824 



PCT/NZOl/00245 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



10/34 

misc_feature 
(10402) (10417) 
ainbiguous nucleotides 



priiner_binci 
(6579) (6601) 
Primer 17F 

.CCTGAGCTTGCCTCTCCCACAGT 



primer_bind 
(7036) . . (7058) 
Primer 18R 

CCAGGAGTCGCCGCAGCAGGAAG 
reverse primer 



primer^bind 
(7280) . . (7303) 
Primer 6F 

CCGGCCATCCAGAACTCCATGAAG 



prime r_bind 

(7585).. (7605) 

Primer AW44 6985 dnl 
TAGAACTCGCGGTCTCCAAAC 
reverse primer 
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<220> 

<221> primer_bind 

<222> (8222) . . (8244) 

<223> Primer InsDpl 

TGGCTGTCACTCATCATCGGGCA 

<220> 

<221> primer_bind ■ 

<222> (8566) . . (8589) 

<223> Priaier 14R2 

TTGCACAGCACTTTATTGACACA 

<220> 

<221> primer^bind 

<222> (6813) . . (6830) 

<223> Primer SNPl^FAM 

AGC TTT GGC AGG TAA GGC 

<220> 

<221> prime r_bind 

<222> (6813) (6830) 

<223> Primer SNP1_HEX 

AGC TTT GGC AGG TAA GAA 

<220> 

<221> prime r_bind 

<222> (5831) . . {6844) 

<223> Primer SNP1_2P 

GGC CAA CGG GGG AG 
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<220> 

<221> primer^bind 

<222> (7424) (7438) 

<223> Primer SNP2 FAM 

GCT GGC GGT^'gaG TGA 

<220> 

<221> prime r_bind 

<222> (7424) (7438) 

<223> . Primer SNP2_HEX 

GCT GGC GGT gag TGG 

<220> 

<221> prime r_bind 

<222> (7439) . . (7453) 

<223> Primer SNP2_2P 

CCT GCT GGG TGG GGA 

<220>. 

<221> primer_bind 

<222> (7442) (7456) 

<223> Primer SNP3_FAM 

GCT GGG TGG GGA CGC 

<220> 

<221> primerjbind 

<222> (7442) . . (7456) 

<223> Primer SNP3_HEX 

GCT GGG TGG GGA CGT 
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<220> 

<221> prinier_binci 

<222> (7457) . . (7470) 

<223> Primer SNP3_P 

GTG GGG GC6 GGT GG 



<220> 

<221> primer^bind 

<222> (8388) . . (8402) 

<223> JPrimer SNP4_FAM 

TGC CCC AAC CTG GGT 



<220> 

<221> primer_bind 

<222> (8388) . . (8402) 

<223> Primer SNP4_HEX 

TGC CCC AAC CTTG GGC 



<220> 

<221> primer_bind 

<222> (8403) . . (8417) 

<223> Primer SNP4_2P 

GCA GCA GGA GGA GGC 

<220> 

<221> primer_bind 

<222> (6811).. (6830) 

<223> Primer. Dgat 21 

. GTAGCTTTGGCAGGTAAGAA 
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<220> 

<221> primer_bind 

<222> (6965) . . (6984) 

<223> Primer Dgat 22 

GGGGCGAAGAGGAAGTAGTA 
reverse primer 

<220> 

<221> primer Jbind 

<222> (6613) . , (6632) 

<223> Primer Dgat 23 

TGGCCCTGATGGTCTACACC 

<220> 

<221> ' primer_bind 

<222> (6829) (6850) 

<223> Primer Dgat 24B 

GGGCAGCTCCCCCGTTGGCCGC 
reverse primer 

<220> 

<221> priiner_bind 

<222> (6651) . . (6670) 

<223> Primer DgatforAD 

TTCTCCTACCGGGACGTCAA . 



<220> 

<221> primer_bind 

<222> (6871) , . (6890) 

<223> Primer ReverseNZ 

CCGCGGTAGGTCAGGTTGTC 
reverse primer 
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<220> 

<221> primerjDind 

<222> (6825) (6838) 

<223> Probe ForAA (FAM) 
CGTTGGCCTTCTTA 

<220> 

<221> primer_bind 

<222> (6823) . . (6836) 

<223> Probe DgatADGC (VIC) 
TTGGCCGCCTTACC 

<220> 

<221> primerjt>ind 

<222> (6651) . . (6670) 

<223> Primer DgatforAD 

TTCTCCTACCGGGACGTCAA 

<220> 

<221> primer^bind 

<222> (6878) . . (6972) 

<223> Primer DgatrevAD 

AAGTAGTAGAGATCGCGGTAGGTCA 
reverse primer 

<220> 

<221> primer__bind 

<222> (6825) . . (6838) 

<223> Probe ForAA (FAM) 
CGTTGGCCTTCTTA 
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<220> 

<221> primerjbind 

<222> (6823).. (6836) 

<223> Probe DgatADGC (VIC) 
TTGGCCGCCTTACC 

<220> 

<221> primerjbind 

<222> (6652) .. (6671) 

<223> Priiaer DgatforRT66 

TCTCCTACCGGGACGTCAAC 

<220> 

<221> primerjbind 

<222> (6874) .. (6964) 

<223> Primer DgatrevRT66 

GAGATCGCGGTAGGTCAGGTT 
reverse primer 

<220> 

<221> primer_bind 
<222> (6711) (6972) 

<223> Primer DgatforRTless 66, GCTGCTTTGGCAGATCTCTACTACTT 

This primer was designed to selectively bind and amplify the cDNA 
splice variant. The corresponding binding site in this genomic s 
equence comprises bases 6711 to 6715, 6815 to 6823 and 6960 to 69 
72. 

<220> 

<221> primerjbind 

<222> (7022) (7038) 

<223> Primer DgatrevRTless66 
AAGCGCTTTCGGATGCG 
reverse primer 
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<220> 

<221> primer_bind 

<222> (6857).. (6870) 

<223> Probe Dgatwith66 (FAM) 
CCGTGAGCTACCC 

<220>- - . 

<221> primer^bind 

<222> (6976) . . (6990) 

<223> Probe Dgatiess66 (VIC) 
CTTCGCCCCCACCCT 



<400> 1 

atg ggc gac cgc ggc ggc gcg ggc ggc tec egg cgc egg agg acg ggg 48 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 

1 5 10 15 

teg egg cct teg ate eag gge gge agt ggg ecc geg gea gcg gaa gag 96 

Ser Arg Pro Ser lie Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 30 

gag gtg egg gat gtg ggc gee gga ggg gac geg ccg gte egg gac aca 144 

Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 



gac aag gac gga gac gta gac gtg ggc age gge cac tgg gac etg ag 191 
Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
50 • 55 60 



gtagcggtge gcgtgacccc 


taacctttga 


cccctgatac 


ggggcccctg 


cgacccaaec 


251 


tggtggccca ggcctgtcgg 


cggeagcteg 


ggctcgagtc 


cgagagtctg 


gcgcctggac 


311 


ettggtgeae agctgtgccc 


etcgggectc 


cacggggaaa 


cttagcggga 


ggttgggggc 


371 


ggagggtetc ctgcccggaa 


cacccaggta 


cgggggecga 


ggggagggca 


gcggctcaac 


431 


ttctagacgc cctccctctg 


cettcctttg 


gtgggttctg 


aagctttccc 


agggtgagee 


491 


eaetacgeac. agtgtcctct 


acetggaagg 


agatacaggg 


gtccttcctg 


agggctatga 


551 


ggggtgcett gtgggttgat 


aaagctcccg 


ggggaggagg 


gtggaceggc 


ggagaacaga 


611 


ggcaggggca gtgctagggg 


atttctcatc 


cctcgcagae 


cetccagaga 


atggtcttca 


671 


caaaggtece tcatccgtca 


cecggcgatt 


gactggccta 


ggatcctgct 


tattaccage 


731 


acaaatgget gctetagggt 


eaaagtgggt 


cctgtaatgg 


gaceetcacc 


cctggttggg 


791 
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gtacagggga 


ggagttggaa 


gtgcgcacac 


ccacaggtgg 


gcgccctgct 


tagctgaagg 


851 


actgatggga 


aggagttggg 


ggagcaagct 


gcggctgaaa 


gggaggatct 


gacccacgtg 


911 


ggcatcagct 


aagtcctgct 


ggctgcctcc 


aggcgttccc 


tttgccatcc 


tccacgcccc 


971 


tccccco'ggq 


cctgaccttc 


atcctggtca 


agggctctca 


ggggctctgg 


ttttgggatc 


1031 


agctccagag 


ctagaggtta 


tcaaggagga 


agtgggcaac 


aggtcagtca 


gcaaggattt 


1091 


gctatcttca 


ctcracT'tactcr 


*" 3 3 3 TS **33 3 3 


agggacaagg 


gcagttgggg 


tgcaggcact 


1151 


gtccctgccc 


^ticracTGcraca 

%m y yy y V«C* 


cacag't'tcac 


ctgagagata 


agatagccgc 


agccctgaag 


1211 


agtgagagca 


aaggtcaggc 


acagagttca 


gga'tgacacc 


'*3333**333 


ggct ctgtga 


1271 


QQQQC3.Ct.aa 


cttcctacag 


gccccaggtig 


crkccbaaagg 


gocaactaca 

3 3 ^3 3 3 


aaggccagga 


1331 


ggcccacagg 


ccccbctgcc 


cactcctggg 


gaac'tggat't 


taaoCTtcact 


tt at at era OCT 


1391 


t crooaQcccrcf 


taccagcttt 


gggccaagct 


gtcaccctgg 


atgggccatc 


acttgcctgc 


1451 


tctgtatagg 


ccaga'tggcc 


agaagct^ct 


cc'tg'tcctg't 


't.ga'tggccca 


tcctcgaggt 


1511 


Qtggaccctc 


gggaagagga 


gcagttggtg 


3 y 3 *" 3 3 


gccaccggag 


accctcctga 


1571 


cctccaggac 


acgcagctgt 


g'tg^gcc'tgti 


ccccaagcca 


catigccacati 


ggctaggggc 


1631 


cbcctggggc 


Aaaactaaac 


attggtctgg 


ctactcttgg 


t a ti c gc ct at^ 


ggc't'tccclig 


1691 


cc'tcccag'^c 


a.tcatcctcc 


cacctctgcc 


'tccc'tgcc'tg 


ttcctctctt 


tclicctcagg 


1751 


ccctliccgga 


ca 1 1 1 c nt* crc 


tcacctacjgt 


ctgggcaggc 


aaadti ca.cf ati 


crcccrocrtcrtci 

ywwy wy ^3 


1811 


agctcactcc 


t'tccggcagc 


aagg'tgtiagc 




aaggaaggcc 


gctgctgttg 


1871 


cctcgcctct 


gagtgcatcc 


cttccaggtc 


ctccacactc 


ccctgtgccc 


cgacacctgg 


1931 


tgcgtccttc 




'bca'tg'tg'tcc 


tccaggcaca 


gc't't'tctiagt 


ccagagcctc 


1991 


taggctgggt 


gcaggaag'hg 


ctgaggaagt 


aacagccggg 

WC2^WW«^«j «J 


aggcgagctg 


gcaccctgtc 


2051 


cctccttgtt 


ctgtccgtcc 


ctgcccctgg 


accg'tatggc 


cccgcatgtg 


tgatccccac 


2111 




gcctctgggc 


aaali'tgaaaa 


ac'ttgg'tgaa 

3** '■33 **3**3 


cctcattttic 


atgtgcccgc 


2171 


ctcccagtac 


tgatgtgcag 


gttgaatgag 


gtgccaactg 


taatgagttg 


gaatggccct 


2231 


gctggctggg 


tgggactggg 


gagcaggtgg 


gggccgctgg 


ggggcacaga 


ggcacaccca 


2291 


gtgcctcagt 


cagggagagg 


gtgacagaga 


agctctgggt 


gaggccccac 


ctccactctg 


2351 


gccatggctg 


ctgccctttg 


gtccactgca 


gtgaactgtg 


ccatggggct 


ggacctctgt 


2411 


ggggattggt 


gggcagtggg 


ctttcttccc 


gcttggggcc 


tctgacctct 


gggggcaggg 


2471 


cgctgcccgg 


gtgggacagt 


cggaaggctg 


gtagagggac 


ctgaggggtc 


tgtgtggtgg 


2531 


ctgggggcag 


gcctcaggaa 


tttgacagca 


gggatctgga 


aaagctttaa 


taacattatt 


2591 
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tgttgtcagg attgggaaat gctcccctcc cccctccccc 


tctttcatct 


tagagactgc 


2651 


tgcacatctg gtcagtgtgg tcttcttggt ggcccccaag 


gtggcagggg 


tcacactgtt 


2711 


atgaaaccgt cccctgggta tgtggtgcag acatgcacat 


gcagatggtg 


attggcaggt 


2771 


tgtagcatga ggtggctttg ggacggttcc agtgacagtg 


agtgggctgg 


atctgggggg 


2831 


ttctgggcag gtccatcaag cggatacccc cacagactgt 


cctcttggga 


tagttgggcc 


2891 


tgggagccct gcttgccttg ccaaaaggca ggcgcagagt 


catgaagaag 


agggcttggg 


2951 


ggctcagagc cccactgtgt gtgcagccca gggtggacct 


ggaggaggtg 


cgtgggcagg 


3011 


ctgggccggc cggggcgggg ggtggggggg cctggtgtga 


aagggaccca 


gggccagact 


3071 


gtcagcgctg cctggctgag gatgctggca ccctgtcctc 


cccagccgtc 


tgtctcctgg 


3131 


gtgcagccat ctgagtgctg accccagccg cccctggagg 


ctggctgttc 


tcctgtgccc 


3191 


tattgctggg gacatgtgtc cacaggaggg aaagggaagc 


cccggcctct 


ccccttacaa 


3251 


aactggaggc cttgctcaat gccctggatg gcctcctggt 


ggcagggtgg 


ttggtgggag 


3311 


gtggggctgc tgcttagaac ccgccagcgg gcctgggcct 


gggctgagct 


gcacccctcc 


3371 


acctctgcct ccagctgagg gttggcttcc atctccacca 


ggcccagcac 


tgggcacagg 


3431 


gctctcagag gcaggctctg aaagtcccct gctggcttct 


gcagtggact 


ccaggcgccg 


3491 


agcccccagg gggctcgcat tgcgctcacc ctgcgaagcc 


acgtgaaggc 


tgggtcctcc 


3551 


cctccggaag ggccaaatgc agggcatggg tggtttgaat 


ggtggcccct 


gggctccccg 


3611 


gagggaccag ctgctgtgag ggccgccccc tccccacttc 


cgtcttgcat 


caccagctcc 


3671 


tgtggcactc cccacgcccc gtcccccagt gggagcggca 


ggcccccggt 


ggctctgccc 


3731 


gcggaggggg atgtgtgggc ggcggggtgg ccttgctgcc 


agatgctctg 


ccccgagtgt 


3791 


ccgtctccgc tctccag g tgt cac cgc ctg cag gat tec ctg ttc agt tct 


3842 



Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser 
65 70 75 



■ gac agt ggc ttc age aac tac cgt ggc ate ctg aat tgg tgt gtg gtg 3890 
Asp Ser Gly Phe Ser Asn Tyr Arg Gly He Leu Asn Tip Cys Val Val 
■ 80 85 90 

atg ctg gtacgtagag tgacaccttg gagcaagggt cctgacggcc ggggggccat 3946 
Met Leu 

gggctcttct ccaggggtag gtgtctgtac ttgtgtagct gtggtgaatg gagctctgtg 4006 

ctggcggtgg gggtccctgg agcagccgta ccctgggacc ctaccgggag catgctcatg 4066 

ccgtccctgc tgaatcccag gagatgcctg cagagggcag cctgggagcc tctgagctgg 4126 

ggtctgcgcc ccagggggca ctggagtctc cccagggggc gagagagagt aggcagggat 4186 
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ggtctggtgg ccctgggtgg gggatggctg ctccgtgggc ccaggccctc cctggcagca 424 6 

caggtgagtg gtcttggggg tccacgtaga acttcctctt ctgttccaaa ttgccctcat 4306 

gggtgcggca tgcctgggtg aacctggggg agcagggtga ggacatgctt ctcagcccag 4366 

cccacagctc caggccacac tctgcaggac tctggcccct ccctcagccc tggagggagc 4426 

aggactggag tcctgtgtcc gccttgctct gacctggccg aggccactgc tgtggggccc 4486 

cagcaggcct gcccagcaga aggtggagtg cagggacccc aggggcagcc ttcagggtgg 4546 

ggcagggtga ggcccgactg ggcccagccc caccgctcag tgctgatgtg gcgcgaggcc 4606 

ttcgcc.cctc cagctgacgt gtctgcctgc cctgggtgtg gctccagagg ctgcctgtgt 4666 

accaggggcc cccacgcttc tgtttgtggt tctgggcagt cccctgggga gcggtggggg 4726 

ctgtgtgcca gtccagaccc agtagtccac gcgtcctggt ctctggaggc cgtggctggt 4786 

ccaggactgt ggcaaggtgg tcgtgcaggg caggccctca gcagcctgtc tgttctcctg 4846 

cagcccccag cctcctggcc ctttggtgca cccacaaagc tcccccctcc cccaggagct 4906 

ggggccgcct gctgcgtcct ctcggcagcc tgggcttcca ggtggctggg cctcttagca 4966 

gctccaactc ttgcctgtgg tgggctctca ggacaggcaa ctgccagtcg gcagacattg 3026 

caggaccacg tgtgtcctgg taagctggct ggttaggtgt ttagctgggg gatggtgtgg 5086 

caggtggccc ctgcatctct gagcctgtca cctcctcggg aagccttctg ggtgggggac 514 6 

tccacccatg tcgcctggag aagcatcact tttccacaga gccttctgca acccccgtgg 5206 

ggcctgagcc tggggtgggg gaggtggtgg cccctgctcc tgcagaggcc agccaggcat 5266 

ctggccccag gccactggca agagctcgtt gtgttggggg atctgtcctt tgctgctgct 5326 

gcaggagcgg ccgaggcagg cgggggcgtg agtaggggtg gagacccagg cccagcttcc 5386 

ccagcccctc aggaccggcc tgctctttcc caccacccca ccaagtgcgt gggcacaccc 544 6 

cgcctgtgag gatgggcccg gttggcaggg cggagccctg ggagggtggc agtgcgccgg 5506 

gcaggcttgg acttcactgg ggcttggggt tgtcgctgtg gccaggggcg ctgacccgct 5566 

tggtgggacg gacggccgct gggcagcagg tttcttctgc cacggtggca caggcacctg 5626 

gggttgtggt tggctccagg cgggcggggg ctgcgtgccc ctgcgcaggc acataggccg 5686 

tgggtgggga gtctcagagc ttggcgtgag gtcccacagg gctgggcctg caggatggag 574 6 

gccactgtcc tgagctgcag gtgctggcag gagctggggt gggcgttctg gggccgtggc 5806 

tgacagcgtt atgtccctct ctctctatcg cag ate tta age aac gca egg tta 5860 

lie Leu Ser Asn Ala Arg Leu 
95 100 

ttt eta gag aac etc ate aa gtgagtgggc cccggectgc cccagcccct 5910 
Phe Leu Glu Asn Leu lie Lys 
105 
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gccacctcac ccctcgccta cacagaccct cacccacctg cgtctgcag g tat ggc 5966 

Tyr Gly 



ate ctg gtg gac ccc ate cag gtg gtg tct ctg ttc ctg aag gac ccc 6014 
lie Leu Val Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro 
110 115 120 125 

tac age tgg cca get ctg tgc ctg gtc att g gtgagctggg tgcccaggag 6065 
Tyr Ser Trp Pro Ala Leu Cys Leu Val He 
130 135 

geetcaggcc ggeggtgggt gggaeagggc tgatctggge ctgaaeetgc cctgggttgc 6125 

ttetgtccte ag tg gcc aat ate ttt gee gtg get geg ttc cag gtg gag 6175 
Val Ala Asn He Phe Ala Val Ala Ala Phe Gin Val Glu 
140 145 

aag cgc ctg gcc gtg gtaagcagtg ccctcacgcc eteecctgae ttgcctcaag 6230 
Lys Arg Leu Ala Val 
150 

gtccttacca gtcgggctta gggcgggcca ceagctggte ccactgtgct tcagggtttt 6290 

gggcctttcg tggccttcct gagaggggct gcacctcagg cctggtggct cttcctcagg 6350 

gaggtcctct gaccagggag gggggtccct ggctgacgct etgetcccac cccag gga 6408 

Gly 

get ctg acg gag cag geg ggg ctg ctg ctg cac ggg gtc aac ctg gcc 6456 
Ala Leu Thr Glu Gin Ala Gly Leu Leu Leu His Gly Val Asn Leu Ala 
155 160 165 170 

ace att etc tge ttc cca geg gcc gtg gcc ttt etc etc gag tct ate 6504 
Thr He Leu Cys Phe Pro Ala Ala Val Ala Phe Leu Leu Glu Ser He 
175 180 185 

act cca g gtgggeeeca eceecgeeec egececcgec eacgetgtct eggeeaeggg 6561 
Thr Pro 

eagcgcgggg ggcgtggcct gagcttgcct cteccacag tg ggc tec gtg ctg 6614 

Val Gly Ser Val Leu 
190 

gcc ctg atg gtc' tac ace ate etc ttc etc aag ctg ttc tec tac egg 6662 
Ala Leu Met Val Tyr Thr He Leu Phe Leu Lys Leu Phe Ser Tyr Arg 
195 * 200 205 

gae gtc aac etc tgg tge ega gag ege agg get ggg gcc aag gee aag 6710 
Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala Lys 
210 ' 215 220 225 

get g gtgagggctg cetegggctg gggecactgg gctgceactt gectegggac .6764 
Ala 

eggcaggggc teggcteace ecegacecgc eccctgecgc ttgetcgtag et ttg 6819 

Ala Leu 
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gca ggt aag aag gcc aac ggg gga get gcc cag cgc acc gtg age tac 6867 
Ala Gly Lys Lys Ala Asn Gly Gly Ala Ala Gin Arg Thr Val Ser Tyr 
230 * 235 240 

ccc gac aac ctg acc tac cgc g gtgaggatcc tgccgggggc tggggggact 6919 
Pro Asp Asn Leu Thr Tyr Arg 
245 , 250 

gcccggcggc ctggcctgct agccccgccc tcccttccag at etc tac tac ttc 6973 

Asp Leu Tyr Tyr Phe 
255 

etc ttc gcc ccc acc ctg tgc tac gag etc aac ttc ccc cgc tee ccc 7021 
Leu Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro 
260 265 270 

cgc ate ega aag cgc ttc ctg ctg egg ega etc ctg gag atg 7063 
Arg lie Arg Lys Arg Phe Leu Leu Arg Arg Leu Leu Glu Met 
275 280 285 

gtgaggcggg gcctcgcggg ccagggtggg cgggectgcc ggcacccggc accggggctc 7123 

agetcaetgt ccgettgctt cettccccag ctg ttc etc acc cag etc cag gtg 7177 

Leu Phe Leu Thr Gin Leu Gin Val 
290 

ggg ctg ate cag cag gtacgtgccc gggggggggg gggggggggg gggggggact 7232 

Gly Leu lie Gin Gin 
295 

ctggggecgt tggggagctg actctgcgct ttttgcag tgg atg gtc ccg gcc ate 7288 

Trp Met Val Pro Ala He 

300 305 

cag aac tec atg aag ccc ttc aag gtgagcaggc aggectggca gggtgggtte 7342 
Gin Asn Ser Met Lys Pro Phe Lys 
310 

cggggteagg gctgagggag ecagctgtgc cctgtgccca cag gac atg gac tac 7 397 

Asp Met Asp Tyr 

315 

tec cgc ate gtg gag cgc etc ctg aag ctg gcg gtgagtgacc tgctgggtgg 7450 
Ser Arg lie Val Glu Arg Leu Leu Lys Leu Ala 
320 325 

ggacgcgtgg gggegggtgg ggctgttetg gcacctggca eccactcccc acag gtc 7507 

Val 



ccc aac cac etc ate tgg etc ate ttc ttc tac tgg etc ttc cac tee 7555 

Pro Asn His Leu lie Trp Leu He Phe Phe Tyr Trp Leu Phe His Ser 
330 335 340 345 

tgc ctg aac gcc gtg get gag etc atg cag ttt gga gac cgc gag ttc 7603 

Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
350 335 360 
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tac egg gac tgg tg gtgggtggcc ttgccggggc gggggtggtg ggggcccccg 
Tyr Arg Asp Trp Trp 
365 

ctggggctgg ggccggagcc cctgcccact ctgccccgcc cccgcag g aac tec gag 

Asn Ser Glu 



tec ate aee tac ttc tgg eag aac tgg aac ate cct gtt cac aag tgg 
Ser lie Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys Trp 
370 375 380 385 

tgc ate ag gtgggtgtgc gectgggggc ggggggttgg ggggtgggae 
Cys He Arg 



ggggtcgcgt ggcccgggcg eccagceeac tgccgcctec cccgcag a cac ttc tac 

His Phe Tyr 
390 

aag ccc atg etc egg egg ggc age age aag tgg gea gee agg aeg gea 
Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Ala Ala Arg Thr Ala 
395 400 405 

gtg ttt ctg gee tec gee ttc ttc cac gag gtcagtgcac tgagggcgcg 
Val Phe Leu Ala Ser Ala Phe Phe His Glu 
410 415 

eectgcccet ggtgggggtg ggggtggggg tgggggcteg ctgacgecec tetcccctea 

g tac ctg gtg age ate ccc ctg cgc atg ttc ege etc tgg gee ttc aee 
Tyr Leu Val Ser He Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr 
420 425. 430 

ggc atg atg geg eag gtgagcagec ctggacccce gctcegecce geccegegag 
Gly Met Met Ala Gin 
435 

egcagaggct cactcccgtc ctgtgtcecc ag ate ccg ctg gee tgg ata gtg 

He Pro Leu Ala Trp He Val 
440 445 

ggc cgc ttc ttc cgc ggc aac tac ggc aac geg gee gtg tgg ctg tea 
Gly Arg Phe Phe. Arg Gly Asn Tyr Gly Asn Ala Aia Val Trp Leu Ser 
450 455 460 

etc ate ate ggg eag ccg gtg gee gtc ctg atg tac gtc cac gac tac 
Leu He He Gly Gin Pro Val Ala Val Leu Met Tyr Val His Asp Tyr 
465 470 475 

tac gtg etc aac cgt gag geg ccg gea gicc ggc ace tgagegcete 
Tyr Val Leu Asn Arg Glu Ala Pro Ala Ala Gly Thr 
480 465 

eaggctggee cectcgtggg tgttggaetg ctttgccgcg ctgcctgcgg ctggactaga 

gcctgcccca acctgggcgc ageaggagga ggcctggetg gtggaagctg cetcctggcc 

.tccaccaggc ctctgcctga agggcttcct cctgccaggg gagageaggc ecgacgcagt 

tctggcccct gggaggtgec catgctctgg aaaccctaea gatctcgcee aagggtctga 
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atgtgtcaat aaagtgctgt gcacagtgag ctccctcagc ctccagggca cagggctggc 8624 

aggagggggc ggccctccca cgtggggcca tgctgtggga aggaggcccc agcgcctgga 8684 

gaggagctgg ggctgtggtg accctccctg cctcacaggg ctctgtggtc agacgtcttg 8744 

ccctgcaagg tggagactcc atgctccaag gccccctgtg cctgaggtct gcacacaagt 8804 

ggattcaact tgggtcaggc cagaggctaa ggtgtggaag aaggttgaga atcaggctga 8864 

cttgaacggc agcaaagact ccaaggcaag gctgcagaag tctcaaaagc tatgcgcaca 8924 

gtcccctgct ggggtgctca cctgggctgg gctctgggct gcttggacaa agcaggtggc 8984 

ctggctcagc cctcaccgag ggccttcctt gggggcagag gttggcctga tgccaggggc 9044 

tccccgtttt tccaggccct cagcaggtag ttgggtgtgg ccctcatgat accttggtcc 9104 

cagagctccg ccactcaaaa agcttggcag tgaggcaagg gcaaccccgg gctgttcccc 9164 

cctctactgg ctctgccgcc tgggttggaa accctgaggc tgtgccaggc aggtgtaccc 9224 

tgacagccag ccatggccca gtaagatggg tgcccgaggt ggtacctggg cagcggaccc 9284 

agctgtgctg cccccgcccc aaccagaagc cgctctagcc catggtggtc gtctgggcga 9344 

gacaggctgg ttggctaggc actgtttggt ctacagcagg tgtaggcagc gtctccctga 9404 

cccctgcctt cHaggaagcc accaccctgn gccctactca tcagcaagga cagcgagcag 9464 

ggctgagctg ggggtgcgtg ggctgctacg gnccccgcac cttcatcaca tgcacctctg 9524 

caccccctgc tgcctgactc aagagtgggg gggggggtcc tgtgcttcct tcattccaga 9584 

cccacggtgc tgacccagtg cacccacctg gtccgctagt gctgacctgg ccacagggct 9644 

cctgtgggcc cacgctgatc ccgccctggt cccttcataa agaactcttg agcacatgca 9704 

gcccagggga gccaggaggc tccagtgtgc tgtgtccatc tgcctccctt cagccccttc 9764 

cgagacactg cgcatcatgc ccccctccac ccccacccac actggcagga ggaacacaca 9824 

gggagaccac acacagagct cgttgtttat aaatctctgc ctggctcatc ggtctgtttg 9884 

tccatgtata tatctgcata tctctatgga aggggaaagg gggactcgtg taaaaatcca 994 4 

aaatacaatt ctatgaacac ctgcatcctg gtcagtctga gtgtggccgt gaagcccagg 10004 

tgagctgtgg ctcacagggc taggccctcg gtgctggccg ggggccactc cccaccccct 10064 

ctccccccct ccgccagcca ggggaccagg ctcctggaca ccaggcctgc ccaaggcctg 10124 

ctctcctcct ggggcttcta cgagacagtg gggtccttgg ctttgggggg ttctgagccc 10184 

gtcagcaggg agatggtggg gtcatctctt atatttcgtc tccctcggag aagtaggagc 10244 

cctcccccag ctcgaagagc accggcaggt cgctgctccc cacgtccacg gagcccgggt 10304 

ccaggagcag caggggctgg gcggtgtagt gcaccagctg cttccctagg ggtgcgactg 10364 
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ggtcagggtg ccggtggggc cggggggcgg ggtggggnim nnnnnimiinn nnnccccccc 10424 

ccccggcccc agccaccccg cctacgcacg ctggccaggc tgctgtccag gtcgggcagg 10484 

ctcatgtcgg gcaccgtaac cgaggggctg aacagctgca gggaagaggg gcgggtcaga 10544 

ctgccctc^ga agcggggcgg gctgggcgcg gggcgggcag ggttagactc acatccagca 10604 

gggtgctggt gtccacgctg aagccatggc ttgtcagcat ggtctgcagg ttgtccaggt 10664 

tggagtccat ggcgtccaag tggtcgctga gctcggtcct ggccagagga aggggagcag 10724 

gtgacgtggc atccaggcac ccccaggtgc agtcctgccg gcccttaagc ccagctgcca 10784 

gcctgccctg cgcaggctca cgggaggccc tgaggtgggg ccctgggcct gggacacttg 10844 

gccaccggtg gtggctcagg ctccctcatc accgtgaggc cccgtaaggc agcacgtcag 10904 

gctgggccgg ctaagacatc agggggctga gttcagggct cagaagggcc cggggcccca 10964 

gcttctcagg ttcagcaacg cctgcccttc acactggagg acagcaggag ggtggagctg 11024 

tgctcggcca ggctgggcca gcctacccca gcatgcgcca ggatagatta ggcctgcccc 11084 

ctcgaggagg ccaggcagta cctcggggtg ctgcctcagg ccccagaggt gggtggggag 11144 

cctgttcaca ggctgggcag gggtctgctc agaggctggg gaggggccca agctctggcg 11204 

gagccctcct ccctccccag tgggacagcg ctaaccctgg ctggactcgg ccatgcagag 11264 

aggaggaggg gcaggggaag aggcgggcga ccccagacat ctttggagtg cgagccaaac 11324 

tgcaagatag aaagacaaga gcccccctct gcgggtgggg cccagcgagt ccaagccccc 11384 

cacdccgttc acccacctct ggggtggtag gacagacaga ccaggcggcc cggggagggc 11444 

ggctcagggg gcaggggtgc ccgccacgtg cctggttccc acggtgcagg ggctgcggga 11504 

ccccctggct ggcatgtctc cctccccagc tccatgctct gggccaggtc ggccagtttc 11564 

ccccagcaaa tcccatgccg agggcctgag ggccatgcgt gtccagcccg gcactgtcct 11624 

tgtcggaggc ccctgcgctt ccagagaccg aagggcgcct ggaaggcact cacttgtcta 11684 

ggcaggcgac gctgaggcac ttctcgggag ccgaggcggg gagggtcgac ggatagcggc 117 44 

cccgagtgat ccgatagaag cttcgta 11771 
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<221> VARIANT 
<222> (232).. (232) 

<223> an amino acid substitution (K -> A) caused by a polymorphism at b 
ases 7224-7225 of the genomic sequence (measured from the adenine 

residue of the translation start codon) . Lysine (K) corresponds 
to the Q allele, alanine (A) corresponds to the q allele. 



<400> 5 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 
1 ■ 5 10 15 



Ser Arg Pro Ser lie Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 30 



Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 



Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
50 55. 60 



Cys His Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser 
65 70 75 80 



Asn Tyr Arg Gly He Leu Asn Trp Cys Val Val Met Leu He Leu Ser 
85 90 95 



Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val 
100 105 110 



Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp 
115 120 125 



Pro Ala Leu Cys Leu Val He Val Ala Asn He Phe Ala Val Ala Ala 
130 135 140 



Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Ala 
145 150 155 160 



Gly Leu Leu Leu His Gly Val Asn Leu Ala Thr He Leu Cys Phe Pro 
165 170 175 
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Ala Ala Val Ala Phe Leu Leu Glu Ser He Thr Pro Val Gly Ser Val 
180 185 190 

Leu Ala Leu Met Val Tyr Thr He Leu Phe Leu Lys Leu Phe Ser Tyr 
195 200 205 



Arg Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala 
210 215 220 



Lys Ala Ala Leu Ala Gly Lys Lys Ala Asn Gly Gly Ala Ala Gin Arg 
225 230 235 240 



Thr Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe 
245 250 255 



Leu Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro 
260 265 270 



Arg He Arg Lys Arg Phe Leu Leu Arg Arg Leu Leu Glu Met Leu Phe 
275 280 285 



Leu Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met Val Pro Ala 
290 295 300 



He Gin Asn Ser Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg He 
305 310 315 320 



Val Glu Arg Leu Leu Lys Leu Ala Val Pro Asn His Leu He Trp Leu 
325 330 335 



He Phe Phe Tyr Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu 
340 345 350 



Leu Met Gin Phe- Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser 
355 360 365 



Glu Ser He Thr Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys 
370 375 380 



Trp Cys He Arg His Phe Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser 
385 390 395 400 



Lys Trp Ala Ala Arg Thr Ala Val Phe Leu Ala Ser Ala Phe Phe His 
405 410 415 
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Glu Tyr Leu Val Ser lie Pro Leu Arg Met Phe Arg Leu Trp Ala Phe 
420 425 430 



Thr Gly Met Met Ala Gin lie Pro Leu Ala Trp lie Val Gly Arg Phe 
435 . . 440 445 



Phe Arg Gly Asn Tyr Gly Asn Ala Ala Val Trp Leu Ser Leu He He 
•450 455 460 



Gly Gin Pro Val Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu 
465 470 475 480 



Asn Arg Glu Ala Pro Ala Ala Gly Thr 
485 
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<400> 6 

Met Gly Asp Arg Gly Gly Ala Gly Gly Ser Arg Arg Arg Arg Thr Gly 
15 10 15 



Ser Arg Pro Ser He Gin Gly Gly Ser Gly Pro Ala Ala Ala Glu Glu 
20 25 30 



Glu Val Arg Asp Val Gly Ala Gly Gly Asp Ala Pro Val Arg Asp Thr 
35 40 45 



Asp Lys Asp Gly Asp Val Asp Val Gly Ser Gly His Trp Asp Leu Arg 
50 55 60 



Cys His. Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser 
65 70 75 80 



Asn Tyr Arg Gly He Leu Asn Trp Cys Val Val Met Leu He Leu Ser 
85 90 95 



Asn Ala Arg Leu Phe Leu Glu Asn Leu He Lys Tyr Gly He Leu Val 
100 . 105 110 



Asp Pro He Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp 
115 120 125 



Pro Ala Leu Cys Leu Val He Val Ala Asn He Phe Ala Val- Ala Ala 
130 135 140 



Phe Gin Val Glu Lys Arg Leu Ala Val Gly Ala Leu Thr Glu Gin Ala 
145 150 155 160 



Gly Leu Leu Leu His Gly Val Asn Leu Ala Thr He Leu Cys Phe Pro 
165 170 175 



Ala Ala Val Ala Phe Leu Leu Glu Ser He Thr Pro Val Gly Ser Val 
180' 185 190 



Leu Ala Leu Met Val Tyr Thr He Leu Phe Leu Lys Leu Phe Ser Tyr 
195 200 205 



Arg Asp Val Asn Leu Trp Cys Arg Glu Arg Arg Ala Gly Ala Lys Ala 
210 215 220 



Lys Ala Ala Leu Ala Asp Leu Tyr Tyr Phe Leu Phe Ala Pro Thr Leu 
225 230 235 240 
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Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys Arg Phe 
245 250 255 



Leu Leu Arg Arg Leu Leu Glu Met Leu Phe Leu Thr Gin Leu Gin Val 
260 265 270 



Gly Leu lie Gin Gin Trp Met Val Pro Ala He Gin Asn Ser Met Lys 
275 280 285 



Pro Phe Lys Asp Met Asp Tyr Ser Arg He Val Glu Arg Leu Leu- Lys 
290 295 300 



Leu Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu 
305 310 315 320 



Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp 
325 330 335 



Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser He Thr Tyr Phe 
340 345 350 



Trp Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe 
355 360 365' 



Tyr Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Ala Ala Arg Thr 
370 375 380 



Ala Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val Ser He 
385 390 395 400 



Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly Met Met Ala Gin 
405 410 415 



He Pro Lei4 Ala Trp He Val Gly Arg Phe Phe Arg Gly Asn Tyr Gly 
420 425 430 



Asn Ala Ala Val Trp Leu Ser Leu He He Gly Gin Pro Val Ala Val 
435 440 445 



Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn Arg Glu Ala Pro Ala 
450 455 460 



Ala Gly Thr 
465 
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